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20.  Abstract  -  continued 

The  subjects  were  9b  Johns  Hopkins  University  undergraduates.  Art  men  and  Art 
women.  They  were  assigned  to  AS  two-person  teams.  One  team  member,  the  source 
was  given  a  completely  assembled  abstract  wooden  model  and  was  required  to 
assist  the  other  member,  the  seeker,  who  had  to  build  an  identical  model  from 
the  separate  parts.  The  two  team  members  were  in  different  rooms,  and  all 
communication  between  them  was  via  teletypewriters. 


Twelve  of  the  teams  were  all  male,  twelve  were  all  female,  twelve  had  a  male 
source  and  female  seeker,  and  twelve  had  a  female  source  and  male  seeker. 

Half  of  the  teams  of  each  type  were  penal ired  for  each  word  they  used;  the 
other  half  were  not.  Finally,  on  half  of  all  teams  the  two  participants  were 
Introduced  to  each  other  beforehand;  on  the  other  half  they  were  not  introduced 
anil  knew  nothing  of  each  other  save  what  thev  could  deduce  from  their  tele- 
tvpewriter  conversation.  This  variable  is  referred  to  as  the  anonymity 
variable . 

) 


Performance  was  assessed  on  four  classes  of  dependent  measure:  (1)  the  time 
required  to  assemble  the  model,  (2)  various  measures  of  verbal  productivity, 

(1)  experimenter  tabulations  of  various  types  of  sub jeet-to-sub lect  information 
requests,  and  (A)  responses  to  a  questionnaire  completed  by  the  subjects  at  the 
conclusion  of  their  session. 


N 


The  only  significant  differences  attributable  to  sex  were  that,  on  the  average, 
teams  with  a  female  source  were  faster,  used  fewer  words,  and  generated  fewer 
of  certain  types  of  information  requests.  There  were  no  significant  differ¬ 
ences  attributable  either  to  the  main  effect  of  sex  of  seeker,  or  to  the 
interaction  of  sex  of  seeker  with  sex  of  «ource.  Further,  there  were  no 
significant  differences  attributable  either  to  the  main  effect  of  anonymity  or 
to  the  Interaction  of  anonymity  with  any  other  variable.  Therefore,  this 
study  found  no  Important  differences  between  men  and  women  that  would  need  to 
be  taken  into  account  when  building  a  natural  language  problem  solving  computer 

J 

There  were  several  highly  significant  differences  between  the  taams  who  were 
unlimited  and  those  who  were  required  to  limit  their  word  usage.  The  most 
interesting  of  these  is  that  the  limited  teams  assembled  the  model  more  quickly 
than  did  the  unlimited  teams.  There  were,  however,  no  significant  differences 
attributable  to  the  interaction  of  this  word  usage  factor  with  any  other 
var  table . 
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Abst  rac t 


It  is  well  documented  that  men  and  women  differ  significantly 
in  several  important  psychological  respects.  The  pattern  of  these 
differences  suggests  the  possibility  that  when  a  natural  language 
problem-solving  computer  is  built,  it  might  have  to  adjust  its  lan¬ 
guage  and  problem-solving  strategy  to  match  that  of  the  sex  of  its 
user.  That  possibility  was  examined  in  the  experiment  reported  here. 

The  subjects  were  96  Johns  Hopkins  University  undergraduates, 

48  men  and  48  women.  They  were  assigned  to  48  two-person  teams.  One 
team  member,  the  source,  was  given  a  completely  assembled  abstract 
wooden  model  and  was  required  to  assist  the  other  member,  the  seeker, 
who  had  to  build  an  identical  model  from  the  separate  parts.  The  two 
team  members  were  in  different  rooms,  and  all  communication  between 
them  was  via  teletypewriters. 

Twelve  of  the  teams  were  all  male,  twelve  were  all  female, 
twelve  had  a  male  source  and  female  seeker,  and  twelve  had  a  female 
source  and  male  seeker.  Half  of  the  teams  of  each  type  were  penal¬ 
ized  for  each  word  they  used;  the  other  half  were  not.  Finally,  on 
half  of  all  teams  the  two  participants  were  introduced  to  each  other 
beforehand;  on  the  other  half  they  were  not  introduced  and  knew  noth¬ 
ing  of  each  other  save  what  they  could  deduce  from  their  teletype¬ 
writer  conversation.  This  variable  is  referred  to  as  the  anonymity 
var  iab le . 

Performance  was  assessed  on  four  classes  of  dependent  measure: 
(1)  the  time  required  to  assemble  the  model,  (2)  various  measures  of 
verbal  productivity,  (3)  experimenter  tabulations  of  various  types  of 
subject-to-sub ject  information  requests,  and  (4)  responses  to  a  ques¬ 
tionnaire  completed  by  the  subjects  at  the  conclusion  of  their  session. 

The  only  significant  differences  attributable  to  sex  were  that, 
on  the  average,  teams  with  a  female  source  were  faster,  used  fewer 
words,  and  generated  fewer  of  certain  types  of  information  requests. 
There  were  no  significant  differences  attributable  either  to  the  main 
effect  of  sex  of  seeker,  or  to  the  interaction  of  seeker  with  sex  of 
source.  Further,  there  were  no  significant  differences  attributable 
either  to  the  main  effect  of  anonymity  or  to  the  interaction  of  anonym¬ 
ity  with  any  other  variable.  Therefore,  this  study  found  no  important 
differences  between  men  and  women  that  would  need  to  be  taken  into 
account  when  building  a  natural  language  problem-solving  computer. 

There  were  several  highly  significant  differences  between  the 
teams  who  were  unlimited  and  those  who  were  required  to  limit  their 
word  usage.  The  most  interesting  of  these  is  that  the  limited  teams 
assembled  the  model  more  quickly  than  did  the  unlimited  teams.  There 
were,  however,  no  significant  differences  attributable  to  the  interac¬ 
tion  of  this  word  usage  factor  with  any  other  variable. 


v 


Introduction 


At  dawn  on  the  morning  of  October 
commanded  by  Lord  Horatio  Nelson  from  his 
sighted  the  combined  fleets  of  France  and 
tant.  What  happened  next  is  described  in 
Nelson  (Churchill,  1808): 


21,  1805,  the  British  fleet, 
flagship  H.  M.  S.  Victory , 
Spain  about  ten  miles  dis- 
a  contemporary  biography  of 


About  twenty  minutes  after  seven  the  enemy,  who  were  in 
very  compact  order,  wore  in  succession,  and  stood  on  the  lar¬ 
board  tack  with  their  heads  toward  Cadiz.  They  kept  a  good 
deal  of  sail  set,  steering  about  two  points  from  the  wind, 
with  their  topsails  shivering;  and  formed  a  double  line,  the 
ships  on  one  being  opposite  the  intervals  of  the  other,  so 
that  they  were  not  crowded,  though  to  our  ships  on  their  beam 
very  little  vacancy  appeared  between  them.  Their  admirals  did 
not  show  their  flags,  but  the  Santissima  Trinidada  being  dis¬ 
tinguished  by  her  four  decks.  Lord  Nelson  ordered  the  Victory 
to  be  steered  for  her  bow... and,  about  half  an  hour  before  the 
enemy  opened  their  fire,  the  signal  was  made  by  telegraph, 
"England  expects  every  man  will  do  his  duty."  (p.  87) 

Thus  began  the  battle  of  Trafalgar.  Because  Nelson's  signal 
to  his  fleet  is  one  of  the  most  famous  telecommunications  in  history, 
it  is  interesting  to  see  what  Nelson’s  signal  officer.  Lieutenant  John 
Pasco,  had  to  say  about  it  (Warner,  1968): 


His  Lordship  came  to  me  on  the  poop,  and  about  a  quarter 
to  noon  said:  "I  wish  to  say  to  the  fleet,  ENGLAND  CONFIDES 
THAT  EVERY  MAN  WILL  DO  HIS  DUTY,"  and  he  added,  "You  must  be 
quick,  for  I  have  one  more  to  make,  which  is  for  close  action." 
I  replied:  "If  your  Lordship  will  permit  me  to  substitute 
expects  for  confides ,  the  sipnal  will  soon  be  completed, 
because  the  word  ’expect’  is  in  the  vocabulary,  and  ’confides’ 
must  be  spelt."  Nelson  replied  in  haste,  and  with  seeming 
satisfaction:  "That  will  do,  Pasco,  make  it  directly." 

(p.  180) 


Although  one  might  argue  that  Nelson's  famous  message  was 
favorably  altered  by  the  device  through  which  he  communicated,  the 
fact  does  remain  that  his  original  message  was  altered.  We  still  have 
such  problems  today.  Whenever  we  use  devices  to  communicate,  the 
devices  themselves  usually  affect  our  communication  in  ways  we  do  not 
yet  completely  understand.  This  is  often  no  mere  inconvenience;  it  is 
a  genuine  problem. 

We  now  have  the  ability,  through  electronic  devices,  to  com¬ 
municate  with  people  anywhere  in  the  world.  Harms  (1974,  chap.  3) 
gives  a  good  description  of  the  "telecomnet "  that  makes  this  possible. 
The  tremendous  technological  achievement  that  the  telecomnet  repre¬ 
sents  may  have  made  many  of  those  associated  with  its  development 


overestimate  its  utility.  For  example,  I.icklider  et  al.  (19t>8)  pre¬ 
dicted  that,  "In  a  few  years,  men  will  be  able  to  communicate  more 
effectively  through  a  machine  than  face  to  face."  Ochsman  and 
Chapanis  (1974)  discuss  this  prediction,  and  state  that  it  has  not  yet 
come  to  pass  because  ot  the  "failure  of  scientists  to  attack  systemati¬ 
cally  the  manv  problems  in  this  complex  (man-machine  communication) 
partnersh  ip. " 

Going  beyond  man-man  communication  via  machines  is  the  fasci¬ 
nating  problem  of  actual  man-machine  dialogue.  Will  it  ever  be  possi¬ 
ble  to  build  computers  that  would  have  all  of  the  beneficial  proper¬ 
ties  of  Arthur  C.  Clarke's  (1968)  fictitious  human-like  HAL?  While  we 
undoubtedly  already  have  the  technological  skills  to  build  the  hard¬ 
ware  for  a  machine  like  HAL,  as  Chapanis  (1971)  pointed  out,  before 
HAL  can  become  a  reality  we  need  a  better  understanding  of  how  people 
themselves  naturally  communicate,  and  we  need  to  know  how  their  com¬ 
munication  is  affected  by  the  devices  through  which  they  converse. 

Horatio  Nelson's  problem  is  still  with  us,  and  only  recently 
have  we  begun  to  understand  its  complex  and  subtle  nature.  In  fact, 
it  is  not  sufficient  to  study  how  people  in  general  communicate,  and  to 
determine  how  their  communication  is  affected  by  telecommunication 
devices.  In  order  to  gain  a  better  understanding  of  the  communication 
process,  it  is  necessary  to  study  different  types  or  groups  of  people 
to  look  for  group-attributable  differences  in  the  way  they  communicate. 
This  means  examining  inter-group  as  well  as  intra-group  dialogue,  and 
seeing  how  the  differences  between  groups  are  affected  by  telecommuni¬ 
cation  devices. 

My  dissertation  experiment  was  a  first  attempt  to  examine  the 
inter-  and  intra-group  dialogue  of  these  two  fundamentally  different 
groups  ot  people:  men  and  women. 

Review  of  the  Literature 

This  review  will  first  provide  a  brief  discussion  of  the 
literature  on  the  human  use  of  telecommunication  systems.  Following 
that,  some  of  the  research  on  the  psychology  of  human  sex  differences 
will  be  examined.  There  is  a  tremendous  amount  of  literature  available 
on  the  psychology  of  sex  differences,  and  this  review  cites  only  those 
references  that  are  relevant  to  the  study  of  sex  differences  as  they 
affect  the  way  people  exchange  factual  information  to  solve  problems. 

Telecommunlcat ion  Research  at  Johns  Hopkins 

Although  the  study  of  human  communication  itself  is  a  major 
subspecialty  within  the  field  of  psychology,  much  of  what  we  now  know 
about  machine-mediated  human  interaction  comes  from  research  done  at 


Johns  Hopkins  under  the  direction  of  Professor  Alphonse  Chapanis. 

Most  of  the  experiments  at  Hopkins  have  examined  two-person  teams  who 
cooperatively  exchanged  factual  information  to  solve  realistic  prob¬ 
lems.  Although  the  complete  criteria  for  these  problems  appear  else¬ 
where  (e.g.,  Chapanis  et  al.,  19  72;  Ochsman  i«  Chapanis,  1974)  ,  tor 
convenience  they  are  summarized  below: 

1.  The  problems  have  sampled  a  wide  range  of  behaviors. 

2.  They  are  representative  of  tasks  for  which  telecommunica¬ 
tions  systems  have  been  used,  or  could  be  used  in  the 

f  uture . 

3.  The  problems  were  not  of  the  abstract  or  artificial  type 
generally  used  to  measure  hypothetical  psychological  pro¬ 
cesses  ,  but  rather  were  the  type  people  might  encounter  in 
everyday  life. 

4.  They  had  definite,  recognizable  solutions  that  could  be 
reached  within  an  hour  or  less. 

5.  They  required  no  special  skills  or  specialized  knowledge 
for  their  solution. 

6.  The  problems  were  structured  so  that  the  two  people  on  the 
team  had  to  communicate  in  order  to  reach  the  solution. 

This  was  accomplished  by  giving  the  two  team  members  com¬ 
plementary  information  folios.  In  general,  one  team  mem¬ 
ber,  the  seeker  ,  is  presented  with  a  problem  he  or  she 
cannot  solve  without  the  help  of  the  source,  who  has  been 
given  all  of  the  information  needed  to  solve  the  problem. 

The  job  role  distinctions  of  seeker  and  source  are  impor¬ 
tant,  and  will  be  used  throughout  this  paper. 

As  an  illustrative  example,  the  "Equipment  Assembly  1'roDlem" 
developed  by  Chapanis  et  al.  (1972)  was  structured  so  that  one  team 
member  (the  seeker)  had  all  of  the  separate  parts  of  a  piece  of  equip¬ 
ment  while  the  other  member  (the  source)  had  the  assembly  instructions. 
The  two  had  to  communicate  because  neither  could  solve  the  problem 
alone.  Another  example  is  the  "Close  Scheduling  Problem"  (Weeks  et  al.. 
1974)  in  which  the  seeker  Is  given  a  list  of  classes  he  must  schedule 
for  himself  so  that  there  are  no  time  conflicts,  doing  so  with  the  help 
of  the  source,  who  has  the  team's  only  list  of  class  meeting  times. 

In  general  it  has  been  shown  that  natural  interactive  human 
communication  is  characterized  bv  extreme  unruliness,  both  grammati¬ 
cally  (Chapanis  et  al.,  1977)  and  in  terms  of  the  number  and  kind  of 
different  words  used  (Michaelis  et  al.,  1977).  However,  people  have 
tremendous  adaptive  capacity,  as  demonstrated  by  Kelly  and  Chapanis 
(1977),  who  limited  the  vocabulary  their  subjects  were  allowed  to  use. 


and  Ford  et  al.  (  1979),  who  limited  the  numbet  ot  words  their  subjects 
could  use.  In  both  these  studies  the  restricted  teams  worked  as  iast 
and  as  accurately  as  did  their  unrestricted  counterparts.  Further, 
although  interruptions  are  a  common  feature  in  normal  human  interuc- 
t  ion ,  Chapanis  and  Overbey  (1974)  have  demonstrated  tliat  they  are  not 
nearly  as  important  as  one  might  suspect. 

How  do  communication  devices  affect  human  interaction?  Five 
different  modes  of  communication  have  been  tested,  either  alone  or  in 
various  combinations:  handwrit ing,  teletypewriting,  telephone,  dose- 
circuit  television,  and,  as  a  control,  face-to-face. 

A  surprising  finding  in  comparing  communication  modes  is  that 
there  is  very  little  difference  between  face-to-face  communication  and 
voice  only  (l.e.,  telephone).  Apparently,  seeing  the  person  vou  are 
conversing  with  adds  little  to  your  ability  to  exchange  factual  infor¬ 
mation  (Chapanis,  1973). 

Interesting  and  highly  significant  differences  appear  when 
contrast ing  oral  modes  of  communicat ion  (voice  and  face-to-face)  with 
"hard-copy"  modes  (.handwriting  and  teletypewriting).  People  communi¬ 
cat  ing  in  oral  modes  are  much  faster  at  solving  their  problems,  yet 
thev  use  many  more  words  (or  tokens),  more  unique  words  (or  types), 
more  sentences,  more  messages,  and  have  lower  type-token  ratios  than 
people  doing  identical  tasks  in  a  hard-copy  mode  (Chapanis  et  al., 

1972;  Chapanis  et  al. .  1977).  In  a  recent  experiment.  Ford  et  al. 

(1979)  demonstrated  that  these  differences  can  be  attributed  to  the 
tact  that  it  is  easier  to  talk  than  to  write  or  type,  and  it  is  the 
basic  inconvenience  of  handwriting  or  typing  that  provides  the  incen¬ 
tive  to  be  concise.  As  an  interesting  sidelight  o  this.  Weeks  et  al. 

(  1979)  compared  the  performance  of  skilled  versus  unskilled  typists  in 
a  teletypewriter  mode  and  found  no  significant  differences  in  their 
performance,  apparently  because  a  relatively  small  percentage  of  one's 
t  Inn-  in  these  experiments  is  actually  devoted  to  sending  messages,  and 
also  because  most  people  compose  messages  as  they  send  them,  thereby 
nullifying  the  advantage  of  being  able  to  type  copy  rapidly. 

There  have  also  been  studies  examining  the  performance  of  vari¬ 
ous  tvpes  or  classes  of  subject.  Parrish  (1974)  found  that  teams  com¬ 
posed  of  college  students  were  faster  than  teams  composed  of  one  col¬ 
lege  student  and  one  high  school  student,  but  these  mixed  teams  were 
faster  than  teams  of  high  school  students.  Regarding  the  mixed  teams, 
those  with  a  college  student  as  the  source  were  faster  than  those  with 
a  high  school  student  in  this  role.  This  suggests  that  the  abilities 
of  the  source  have  a  greater  impact  on  the  team's  performance  than  do 
the  abilities  of  the  seeker. 

Parrish  (1978)  has  since  conducted  a  small  study  in  which  he 
used  a  similar  paradigm  to  compare  incoming  college  freshmen  to  seniors 
at  the  I’nlversltv  of  California,  Santa  Barbara.  Solution  times  did  not 
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Jitter  significantly  between  the  two  groups.  This  suggests  that  tin 
differences  he  found  lit  his  earlier  study  are  not  attributable  to  edu¬ 
cational  level  or  age.  However,  the  differences  might  have  been  duo 
to  intellectual  ability.  Desiring  to  examine  intellectual  ability 
more  closely,  Hoecker  (197'))  tested  high  school  students  to  determini 
the  effects  of  both  verbal  and  spatial  ability  on  their  over-all 
ability  to  use  telecommunications  in  problem  solving.  In  general, 
subjects  who  scored  high  on  the  Differential  Aptitudes  Test  subtest  of 
spatial  ability  did  better  in  his  study  than  those  who  scored  low. 
However,  verbal  ability,  as  determined  by  the  Iowa  Test  of  Basic  Skills 
Reading  Comprehension  Test,  could  only  be  shown  to  affect  performance 
on  a  problem  that  required  the  subjects  to  obtain  their  information 
from  a  written  list. 

One  very  basic  human  difference  that  has  not  been  closely  exam¬ 
ined  by  the  Hopkins  group  is  sex.  All  but  one  of  the  Hopkins  experi¬ 
ments  have  used  subjects  from  just  one  sex,  usually  males.  This  was 
done  partly  out  of  convenience,  as  Hopkins  has  a  predominantly  male 
undergraduate  population,  but  primarily  because  we  have  no  real  know¬ 
ledge  about  the  possible  effects  of  the  sex  of  the  subjects  on  their 
behavior  in  a  factually  oriented  communieat ion  situation.  If  there  are 
effects  on  communication  attributable  to  sex  differences,  simple  random 
assignment  to  experimental  conditions  of  subjects  drawn  from  the  gen¬ 
eral  population  would  have  increased  the  error  variance  in  statistical 
analyses . 

The  experiment  by  Kord  et  al.  (  1  *■) 7 c> was  the  first  at  Hopkins 
to  include  the  variable  of  sex.  Although  the  major  focus  ol  their 
study  was  on  sel f -rest r icted  word  usage,  halt  ot  their  teams  were  com¬ 
posed  of  male  subjects,  and  the  other  half  of  female  subjects.  The 
data  contained  no  statistically  significant  main  effects  attributable 
to  sex  and  onlv  as  many  significant  interactions  of  sex  with  another 
variable  as  would  have  been  expected  by  chance.  However,  the  primary 
focus  of  this  study  was  not  on  differences  between  the  sexes  and,  as  a 
result,  several  independent  and  dependent  variables  relating  to  sex 
differences  were  not  Included  in  the  experimental  design.  For  example, 
there  were  no  mixed-sex  teams.  Also,  there  were  no  clearly  defined  job 
roles  (l.e.,  information  source  and  seeker)  but,  rather,  both  subjects 
in  the  dyad  started  their  session  with  roughly  equal  amounts  ot  infor¬ 
mation.  For  these  reasons,  no  "within  teams"  measures  were  taken  and , 
as  a  result,  anv  possible  interaction  of  sex  with  job  role  could  not 
be  examined. 

At  the  University  of  California,  Santa  Barbara,  Parrish  and 
Michael  is  (submitted  for  publication)  conducted  an  experiment  to 
explore  the  behavior  of  like-  and  mixed-sex  teams  during  cooperative 
problem  solving.  The  performance  of  the  teams  was  assessed  on  time 
required  to  solve  the  problem  and  on  behavioral  measures  ot  activit\ 
such  as  the  type  described  by  Chapanis  et  al.  (ll)72).  The  behavioral 
data  were  obtained  through  direct  observation  of  the  subjects  while 
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they  worked;  every  five  seconds  an  observer  would  note  which  one  of 
the  14  different  previously  defined  behaviors  the  subject  was  engaged 
in.  There  were  eight  different  single  activities:  Sending  Only,  Send¬ 
ing  Pause,  Receiving  Only,  Searching  Only,  Handling  Parts,  Making 
Notes,  Waiting  and  Other;  and  six  dual  categories;  Sending  and  Search¬ 
ing,  Sending  and  Handling  Parts,  Receiving  and  Handling  Parts,  Receiving 
and  Making  Notes,  Searching  and  Handling  Parts,  and  Searching  and  Mak¬ 
ing  Notes.  Their  sampling  method  allowed  Parrish  and  Michaelis  to 
estimate  the  actual  amount  of  time  the  subjects  spent  in  each  behavior 
category . 


Of  the  14  behavioral  categories  sampled,  six  categories 
accounted  for  about  85  percent  of  the  teams'  activity,  on  the  average. 
These  six  categories  were:  Sending  Only,  Receiving  Only,  Searching 
Only,  Handling  Parts,  Making  Notes,  and  Waiting. 

As  was  the  case  with  Ford  et  al.  (1979)  ,  Parrish  and  Michaelis 
found  no  differences  between  their  all-male  and  all-female  teams. 
However,  their  mixed-sex  teams,  on  the  average,  took  over  50  percent 
longer  to  solve  their  problems  than  did  teams  of  like  sex.  Because  of 
the  small  number  of  subjects  tested  and  the  large  amount  of  variability 
in  performance,  however,  this  difference  was  barely  significant  at 
£  <  0.05. 

Parrish  and  Michaelis  tested  two  different  communication  prob¬ 
lems.  When  Parrish  and  Michaelis  designed  these  two  problems,  they  had 
expected  that  males  would  be  superior  on  one  of  them  with  females 
superior  on  the  other.  As  it  turned  out,  in  the  analysis  of  variance 
for  time  to  solution,  the  interaction  of  sex  with  problem  actually 
yielded  an  F  ratio  of  less  than  one,  so  that  there  was  not  even  a  sug¬ 
gestion  of  significance.  Further,  an  analysis  of  the  behavioral  data 
among  groups  of  different  sexual  composition  showed  that,  on  the  aver¬ 
age,  the  different  groups  generally  spent  about  the  same  percentage  of 
time  in  each  activity. 

While  the  main  effect  of  job  role  is  not  of  great  importance 
in  this  type  of  study,  the  interaction  of  job  role  with  sex  gives  an 
indication  of  whether  or  not  males  and  females  do  the  identical  jobs 
differently,  and  also  whether  or  not  their  performance  changes  depend¬ 
ing  on  the  sex  of  their  partner.  This,  in  fact,  is  apparently  what 
happened. 

The  most  striking  interaction  was  for  the  category  of  Search¬ 
ing  Only.  Male  seekers,  no  matter  with  whom  they  were  paired,  averaged 
about  47  percent  of  their  time  in  this  category,  and  female  seekers  on 
like-sex  teams  averaged  about  42  percent,  while  female  seekers  who  had 
been  paired  with  a  male  source  averaged  only  29  percent  of  their  time 
Searching  Only.  However,  these  female  mixed-team  seekers  spent  more 
time  in  the  dual  categories  of  Sending  and  Searching  and  Searching  and 
Making  Notes  than  did  the  other  seekers,  causing  the  interactions  for 
these  two  measures  to  reach  significance  also.  These  two  activities 


combined  account  tor  about  8  percent  ol  the  tomale  mixed-sex  seekers' 
time,  as  compared  witli  about  1  to  2  percent  tor  the  othoi  seekers. 
Parrish  and  Michael  is  otter  no  hypothesis  to  explain  these  findings, 
but  they  do  point  out  that  these  t  einale- seeker  mixed- sex  teams  aveiagcd 
greater  times  to  solution  than  did  teams  in  the  other  groups. 

Another  finding  of  interest  is  that  female  source;,  averaged  a 
greater  proportion  of  their  time  in  dual  activities  than  did  mail 
sources,  and  this  was  true  regardless  ol  the  sex  ot  the  seeker.  Again, 
Parrish  and  Michael  is  declined  to  offer  an  interpretation. 

The  major  reason  Parrish  and  Michaelis  frequently  did  not 
interpret  their  findings  is  because  their  experiment  was  tar  too  lim¬ 
ited  to  allow  them  to  do  so.  Among  these  limit,  u  ions  were: 

1.  The  number  of  subjects  was  quite  small,  as  only  tour  all- 
male,  four  all-female  and  eight  mixed  teams  were  tested, 
with  each  team  tested  only  once.  As  such,  many  possibly 
informative  independent  variables  could  not  be  included  in 
the  design. 

2.  The  subjects  were  unpaid  volunteers.  Unlike  the  Hopkins 
experiments,  this  study  at  Santa  Barbara  provided  no  mone¬ 
tary  incentive  tor  the  subjects  to  be  e  it  tier  fast  or  accu¬ 
rate. 

3.  The  subjects  were  in  adjoining  rooms,  and  communicated  by 
passing  handwritten  notes  to  each  other  through  a  slot  in 
the  wall  that  separated  them.  The  subjects  were  able  to 
hear  each  other  quite  easily,  and  although  they  did  not 
formally  speak  to  each  other  during  the  course  of  the 
experiment,  undoubtedly  such  things  as  laughtei  or  groan¬ 
ing  could  have  biased  the  results,  especially  with  the 

m ixed-sex  t  earns . 

s.  The  subjects  seemed  to  take  their  tasks  much  less  seri¬ 
ously  than  do  Hopkins  subjects.  This  may  have  been  because 
of  the  relatively  impoverished  experimental  conditions,  but 
also  because  the  Santa  Barbara  students,  in  general,  seem 
to  have  less  achievement  mot  ivat  Ion  than  Hopkins  students. 
(This  final  observation  is,  of  course,  purely  subjective.) 

With  these  caveats  In  mind,  it  is  apparent  that  Parrish  and  Michaelis 
left  unanswered  many  questions  about  the  communicative  behavior  of  likc- 
and  mixed-sex  teams  during  cooperative  problem  solving. 

Sex  Differences  in  Cognitive  Styles  and  Abilities 

There  Is  a  great  deal  of  literature  available  concerning  the 
psychological  and  sociological  aspects  ot  human  sex  differences.  Sonic 
of  that  work  has  relevance  to  the  study  ot  sex  differences  in  interac¬ 
tive  problem  solving. 
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On  ttia  other  hand,  tests  at  quant  ltat  ive  amt  spat  tat  ab  i  t  it  v 
far  subjects  aval  the  age  at  In  at  tan  yield  higher  scares  tat  mates 
(sea,  tat  example.  Uteri  at  at.,  19S8;  Raseiiboig  A  Sut  ton -Smith,  19hh, 
l  'he) ;  Stewart,  1  '•  7  (> ;  Vo  r  v  ,  t  '*  (>  7 '  .  1  t  lust  t  a  t  t  vc  data  are  t  lias  a  a  t  Man 

dav  at  al.  (I9bti  b71  who  examined  ACT  math  subscotos  al  <7  18-veai 

aids,  and  t  aumt  that  the  mean  score  at  males  was  0 .  4  7  st  andat  it  devia 
t  tons  above  that  at  the  tamales.  Kurthei  ,  it  lias  tang  been  known  that 
"mathematical  ability"  is  not  a  single  facial  bv  itselt,  but  that  it  is, 
rather,  made  up  at  main  independent  component  ■■  tact ai  analyses  bv 
ltlaekwelt  (  1  940)  and  Very  (I9t»7>,  ls  well  as  athets,  'haw  a  dltteient 

faetar  structure  tat  males  and  temales.  Kai  example,  facial  analyses 

done  on  males  typically  show  the  piesence  at  ana  ai  mate  tactois  t,M 
spatial  ability,  white  analyses  done  on  women  genet  al 1 v  da  not  shew 
these  factors.  As  an  int  crest  lug  sidelight  to  this,  the  Harvard  Pro 
ii’ct  Physics  (Walberg,  1  '*(■>'>  A  a  dm  in  i  s  t  e  red  physics  achievement  tests  to 
hath  male  ami  temale  high  school  physics  students,  and  found  that  the 
males  did  better  than  the  temales  an  test  items  calling  I  at  visual  spa 
till  skills,  while  temale.  did  bettor  an  vet  ha  1  test  items. 

Because  men  and  women  dittot  in  theti  cagntt  ive  styles  and 
ab  1 1 i t  fos,  Mac cob v  and  Tacklin  (1974,  p.  911  t betel  are  assert  that 
they  also  dittot  in  the  wav  they  attack  problems.  It  i s  certainly  not 
Impassible  that  some  sort  at  mismatch  in  pteblom  salving  strategies  was 
involved  when  Patrish  and  Michael  i  s 1  s  mixed  ’-ex  teams  took  '0  percent 
longer  than  like  sex  teams  to  salve  identical  ptablems. 

In  addit  (on  to  looking  at  dltterences  in  mean  scores  between 
the  sexes,  psychologists  have  also  been  Interested  in  ditterences  in 
v.triabll  it  v.  Over  a  halt  century  ago.  Terman  (19.'S)  reported  that  mare 
bays  than  girls  had  l.Q.s  in  excess  at  140,  even  though  the  mean  scares 
I  or  the  two  groups  we  t  e  about  the  same.  fills,  an  the  suit  ace ,  suggests 
that  the  variability  in  scares  is  higher  tat  males  than  tat  temales; 
howovot  ,  Berman's  subjects  wore  not  randomly  selected,  thotobv  making 
Interpretation  at  this  t  tnd 1 ng  d 1 1  t  leu  I  t  . 

Kegarditig  tested  verbal  ability,  mate  iccentlv,  1H  aege  (19n) 
teparted  that  the  seated  for  Is  to  l  7-veat  old  temales  an  the  t.'ATN 
(Verbal)  were  slight  lv  mate  variable  than  those  at  males,  although 
Oates  ( 1  1  1  and  Svensson  (1971)  hath  reported  1  ' -vent  a  1 d  males  to  he 
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more  variable  on  verbal  test  items  than  females  of  the  same  age. 
Apparently,  as  Maccoby  and  Jacklin  ( 1974,  p.  120)  point  out,  despite 
the  generally  higher  verbal  scores  for  females,  the  variability  of 
males  and  females  on  these  tests  is  about  the  same. 

With  regard  to  mathematical  ability,  however,  there  is  strong 
evidence  to  suggest  that  males  are  more  variable  than  females.  Flana¬ 
gan  et  al.  (1961),  Monday  et  al.  (1966-67),  and  Svensson  (1971)  all 
report  that  males  not  only  have  higher  mean  scores  than  females,  but 
also  have  greater  standard  deviations.  This  same  finding  is  also 
reported  by  Keating  (1974),  but  Keating's  subjects  were  196  junior 
high  school  students  who  had  entered  a  science  and  math  contest, 
either  after  having  been  selected  by  their  teachers  or  through  volun¬ 
teering.  Presumably ,  there  was  no  sexual  bias  in  the  selection  of 
these  subjects,  and  even  in  this  carefully  selected  group  the  males 
were  still  more  variable  and  had  a  higher  mean  score  than  the  females. 

To  summarize:  On  tests  of  verbal  ability,  college-age  females 
tend  to  average  scores  slightly  higher  than  those  of  college-age  males, 
although  the  two  groups  exhibit  roughly  equal  variability.  On  tests 
of  mathematical  ability,  the  males  tend  to  do  better  than  do  the 
females  and  also,  as  a  group,  tend  to  be  more  variable.  Factor  analy¬ 
ses  show  that  the  two  sexes  tend  to  use  different  cognitive  skills  for 
solving  problems. 

Sex  Differences  in  Personal ity 

Not  only  are  there  sex  differences  in  cognitive  style  and 
ability,  but  there  are  also  sex-linked  personality  differences. 

Clearly,  there  are  many  such  differences  between  males  and  females,  so 
this  review  will  examine  only  those  that  appear  most  likely  to  affect 
behavior  in  interactive  problem  solving. 

The  aspect  of  personality  that  may  have  the  most  bearing  on 
performance  of  problem-solving  tasks  is  the  basic  one  ol  interest.  It 
is  not  at  all  unreasonable  to  assume  that  people  will  do  better  at 
tasks  they  find  interesting  than  at  those  they  find  dull.  We  must 
therefore  ask  this  question:  Does  the  subject  in  our  experiment  nor¬ 
mally  find  tasks  such  as  the  one  assigned  to  be  of  interest?  Notice 
that  the  question  did  not  ask  if  the  subject  found  the  actual  experi¬ 
mental  task  interesting:  nearly  all  subjects  in  previous  Hopkins  com¬ 
munications  experiments,  males  and  females  alike,  found  participation 
in  the  experiments  both  interesting  and  enjoyable.  Instead,  the  ques¬ 
tion  seeks  to  find  out  if  they  are  normally  interested  in  this  type  ot 
task  because,  if  so,  then  they  are  more  likely  to  have  had  prior 
experience  with  tasks  that  required  similar  cognitive  abilities.  As 
such,  they  might  well  have  some  advantage  in  the  performance  ot  the 
experimental  task,  or,  at  the  very  least,  we  might  expect  them  to  use 
a  different  strategy  than  someone  with  little  prior  experience. 
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(Vs  would  bo  expected,  males  and  females  have,  on  the  average, 
different  Interests.  Monday  ot  al.  (1966-67)  report  that  males  tend 
to  have  more  interest  in  technologically  oriented  fields,  such  as 
engineering,  while  females  tend  to  have  a  greater  interest  in  social, 
religious,  and  educational  fields. 

Holland's  (1965)  Vocational  Preference  Inventory  has  been  used 
by  Ast  in  (1°?'*)  and  Kox  et  al.  (1976)  to  examine  tin  vocational  inter¬ 
ests  of  youngsters  who  had  been  Identified  as  being  mathematically  and 
sclent ii  ically  precocious.  The  Vl’l  has  six  categories  of  vocational 
preferences:  Realistic,  Investigative,  Enterprising,  Artistic,  Social, 

and  Conventional.  The  males,  in  general,  tended  to  opt  for  vocations 
that  were  classed  as  Investigative  or  Realistic,  while  the  females 
opted  for  Social  or  Artistic  careers.  One  might  reasonably  expect 
that  people  who  are  "realistic"  and  "investigative"  might  be  more 
naturally  oriented  toward  problem-solving  tasks. 

As  it  turns  out,  in  actual  tasks  Involving  problem  solving, 
males,  in  general,  seem  more  confident  about  their  abilities  than 
females  (Carey,  ll)58;  Crandall,  1969;  Erl  ick  Le  Bold,  1976;  Feather, 
1968;  Rychlak  6  Center,  1965)  and  claim  to  be  more  sure  of  themselves 
in  new  situations  (Schwartz  Clausen,  1970).  Further,  Deaux  (  1976) 
has  reported  that  males  are  more  likely  than  females  to  attribute  suc¬ 
cess  at  <i  task  tit  skill;  the  females  tend  to  attribute  their  success  to 
luck. 


In  general,  we  have  seen  so  far  that  males  and  females  differ 
not  only  in  cognitive  areas,  but  also  in  their  interests  and  attitudes 
toward  problem  solving. 

Sex  Differences  in  Interpersonal  Relationships 

Cognitive  patterns,  interests,  and  levels  ot  self-confidence 
are  all  intrapersonal  attributes  that  could  affect  problem-solving 
ability.  In  addition,  there  are  sex-linked  Interpersonal  attributes 
that  could  affect  performance  in  a  situation  requiring  two  people  to 
work  together  to  solve  a  problem. 

ihte  of  the  most  widely  recognized  of  those  sex-linked  attri¬ 
butes  is  that  of  aggression.  Studies  using  questionnaires  and  stan¬ 
dardized  tests  have  found  that  males  tend  to  be  more  aggressive  and 
less  patient  than  female:;  (Barclay,  1970;  Youssef,  1968).  This  finding 
is  confirmed  by  experimental  studies  (Doob  N  Gross,  1968;  Shorter  et 
al.,  1966;  Youssef,  1968).  In  light  of  these  fittdings,  it  is  not  sur¬ 
prising  that  females  tend  to  "take  orders"  better  than  males  (Maccoby 
A  Jackl  in,  1974,  P.  371) . 

Mi> s t  of  these  studies,  however,  were  done  before  "feminism" 
became  popular,  so  we  need  to  ask  if  the  female  of  today  would  st  ill 
be  less  aggressive  than  males.  Despite  the  studies  suggest ing  that 
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sex-linked  differences  in  personality  may  be  attributed  o  differences 
in  upbringing,  there  is  strong  evidence  that  differences  in  aggressive 
behavior  are  largely  biological.  It  has  been  frequently  demonstrated 
that  the  level  of  aggression,  in  both  humans  and  primates,  is  directly 
correlated  with  the  level  of  androgens  in  the  blood,  and  that  experi¬ 
mentally  varying  the  level  of  androgens  in  an  individual  has  a  direct 
effect  on  his  or  her  aggressive  behavior  (Barfield,  1976;  Ehrhardt  6 
Baker,  1973;  Money  6  Ehrhardt,  1972).  As  such,  we  may  expect  that  the 
previously  noted  sex  differences  in  aggressive  behavior  are  still 
val  id . 


The  behavior  of  like-  and  mixed-sex  dyads  lias  also  been  studied, 
but  this  has  generally  been  through  the  use  of  tasks  and  situations  of 
little  apparent  relevance  to  studies  that  required  people  to  cooperate 
in  exchanging  factual  information.  As  an  example,  McNeal,  McClintock 
and  Nutt  in  (  1972)  examined  the  behavior  of  like-  and  mixed-sex  pairs  in 
the  Prisoner's  Dilemma  game,  and  found  that  teams  oi  mixeo  sex  were 
less  competitive  than  those  oi  Like  sex.  Bedell  and  Sistrunk  (1973) 
also  studied  the  Prisoner's  Dilemma,  and  reported  that  the  male-male 
and  male-female  pairs  both  cooperated  more  often  than  did  the  female- 
female  pairs.  Rapoport  and  Chammah  (1965)  reported  that  all-male 
pairs  were  more  cooperative  than  mixed-sex  or  all-female  pairs,  while 
Tedesehi,  lesnick  and  Cahagan  (  1968)  reported  females  to  be  more  cooper¬ 
ative  than  males.  Although  the  four  reports  presented  above  disagree 
in  their  findings,  an  examination  of  the  bulk  of  the  studies  done  on 
this  topic  suggests  that  males  are,  in  general,  more  cooperative  than 
females  in  this  type  of  situation.  It  should  be  noted  in  this  regard 
that  some  researchers  feel  that  males  are  only  physically  more  agres- 
sive,  while  females  are  aggressive  in  more  subtle  ways  (e.g.,  "catti¬ 
ness")  that  could  affect  how  well  and  how  easily  they  are  able  to 
cooperate  with  others  (see,  for  example,  Feshbach,  1970;  McIntyre, 

1972).  The  findings  on  this  matter,  however,  are  far  from  conclusive. 

On  the  other  hand,  a  series  of  studies  reported  by  Rosenthal 
(1976)  does  bear  directly  on  the  question  oi  sex  effects  in  dyads  when 
one  person  is  giving  instructions  to  another.  Rosenthal's  primary 
interest  is  in  experimenter-subject  interactions,  but  his  finding  could 
easily  extend  to  the  source-seeker  interactions  of  our  communication 
experiments.  He  reported  that  experimental  sessions  generally  last 
longer  when  the  experimenter  and  subject  are  of  the  opposite  sex,  as 
compared  to  when  they  are  of  the  same  sex.  This  was  despite  the  fact 
that  the  experimenters,  in  all  cases,  were  reading  from  identical 
instruction  sheets,  and  were  therefore  giving  verbally  identical 
instructions.  Further,  in  a  personal  communication,  Rosenthal  reported 
that  even  in  the  portion  of  the  sessions  where  the  experimenter  was 
simply  reading  the  Instructions  to  the  subject,  mixed-sex  experimenter- 
subject  dyads  more  often  than  not  took  longer  than  like-sex  dyads. 


With  regard  to  the  study  of  telecommunication  systems  at  Johns 
Hopkins,  it  has  been  demonstrated  that  when  the  devices  are  substituted 
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for  face-to-face  communication,  these  devices  can  affect  the  way  that 
people  "naturally"  behave. 

With  regard  to  the  study  of  human  sex  differences,  it  has  been 
demonstrated  that  men  and  women  differ  in  their  cognitive  styles, 
abilities,  and  strategies,  in  their  likes,  dislikes,  and  attitudes 
toward  problem-solving  situations,  and  in  the  way  they  interact  with 
other  people. 

In  his  study  of  the  ways  in  which  an  experimenter  can  inadver¬ 
tently  bias  the  results  of  an  experiment,  Rosenthal  found  that  subjects 
generally  worked  faster  if  they  had  received  their  instructions  from  an 
experimenter  of  the  same  sex,  even  though,  in  all  cases,  the  experi¬ 
menters  had  been  reading  to  the  subjects  from  identical  instruction 
sheets.  Apparently,  the  people  who  had  participated  in  Rosenthal's 
study  behaved  differently  depending  on  the  sex  of  the  person  with  whom 
they  had  been  paired. 

In  a  study  by  Parrish  and  Michaelis  which  examined  the  behavior 
of  two-person  teams  during  cooperative  problem  solving,  it  was  found 
that,  on  the  average,  mixed-sex  teams  took  over  50%  longer  than  like- 
sex  teams  to  solve  identical  problems.  This  may  simply  be  just  one 
more  example  of  subjects  behaving  differently  because  of  the  sex  of 
their  partner.  However,  the  possibility  exists  that  this  finding  could 
also  be  due,  at  least  in  part,  to  some  sort  of  mismatch  between  men  and 
women  in  the  way  they  communicate  to  solve  problems. 

There  is  certainly  ample  evidence  in  the  literature  on  sex  dif¬ 
ferences  to  support  the  "mismatch  hypothesis."  If  a  mismatch  of  this 
type  does  exist,  then  it  will  not  only  have  theoretical  implications, 
but  a  practical  one  as  well:  the  envisioned  natural  language  computers 
of  the  future,  if  they  are  to  be  completely  human-factored,  may  have  to 
be  strictly  neuter,  or  perhaps  even  adjust  themselves  to  the  sex  of 
their  human  user. 

On  the  other  hand,  the  subjects  tested  by  Rosenthal  and  by  Par¬ 
rish  and  Michaelis  were  normal  college-age  men  and  women.  Perhaps  the 
mixed-sex  pairs  did  not  do  as  well  because  they  had  more  on  their  minds 
than  the  assigned  experimental  tasks.  While  there  is  little  or  no  evi¬ 
dence  of  social  interaction  in  the  verbal  communication  of  these  sub¬ 
jects,  the  experimenters  made  no  attempt  to  control  the  possibly  con¬ 
founding  effects  of  non-verbal  communication.  In  fact,  Rosenthal 
attributes  his  findings  to  non-verbal  communication  between  his  sub¬ 
jects,  and  this  might  well  account  for  the  findings  of  Parrish  and 
Michaelis. 


Purpose 

The  literature  review  cited  many  studies  in  which  males  and 
females  were  demonstrated  to  be  significantly  different  on  several 
psychological  criteria.  The  purpose  of  my  dissertation  is  to  find  out 
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whether  males  and  females  also  differ  in  any  practical  way  in  how  they 
exchange  information  over  a  telecommunication  system.  In  particular, 
it  seeks  to  discover  differences  that  should  be  accounted  for  in  the 
design  of  a  natural  language  computer. 

In  order  to  accomplish  this  goal,  my  dissertation  experiment 
was  designed  to  answer  the  following  specific  questions:  When  two 
people  of  either  sex  solve  a  problem  by  communicating  over  a  teletype¬ 
writer  system,  what  differences,  if  any,  would  there  be  between  the 
way  the  males  and  females  communicated?  Further,  what  differences 
would  there  be  between  the  like-  and  mixed-sex  pairs  of  communicators? 
What  effect  would  it  have,  both  in  general  and  on  the  above  differ¬ 
ences,  if  these  people  communicated  without  knowing  each  other's  sex? 
And,  finally,  what  would  happen  to  the  communication  if  the  communi¬ 
cators  are  given  an  incentive  to  be  as  concise  as  possible?  The 
answers  to  these  questions  should  provide  an  indication  of  whether 
there  are  any  practical  differences  in  the  way  men  and  women  communi¬ 
cate  to  solve  problems. 


Method 


Subjects 


The  subjects  were  96  undergraduate  students  from  the  Johns  Hop¬ 
kins  University,  A8  males  and  48  females.  They  were  recruited  through 
posters  advertising  an  "interesting  telecommunications  experiment." 

Experimental  Facilities 


The  experimental  facilities  consisted  of  three  major  and  fairly 
distinct  items:  the  laboratory,  the  telecommunication  system,  and  the 
auxiliary  equipment. 

The  laboratory.  This  experiment  was  conducted  in  the  telecom¬ 
munication  laboratory  at  Johns  Hopkins.  A  diagram  of  the  laboratory 
is  shown  in  Figure  1.  This  experiment,  like  many  others  done  here, 
assigned  subjects  to  the  job  roles  of  source  and  seeker.  For  all  ses¬ 
sions,  sources  were  assigned  to  room  A  and  seekers  to  room  C;  rooms  B 
and  D  were  not  used. 

Each  subject's  room  contained  a  chair,  a  worktable,  and  equip¬ 
ment  for  several  different  telecommunications  media,  although  only  the 
teletypewriters  were  used  in  this  study.  I  stayed  in  the  observation 
room  during  the  experimental  sessions,  and  was  able  to  observe  the 
subjects  through  one-way  mirrors. 

Telecommunication  equipment.  The  subjects  communicated  through 
two  interconnected  input-output  writers  (IBM  Model  It 735).  These  input- 
output  writers  closely  resemble  regular  electric  typewriters,  and  have 
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Figure  l.  The  t o I ecommun  lent  ion  laboratory  at  Johns  Hopkins. 
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standard  typewriter  keyboards,  except  for  one  extra  key  which  is 
labeled  "SEND." 

This  teletypewriter  system  was  restricted  to  half-duplex  oper¬ 
ation.  This  means  that  only  one  subject  could  transmit  at  any  given 
time.  When  a  subject  wanted  to  send  a  message,  he  had  to  first  obtain 
control  of  the  system  by  pressing  the  send  key  of  his  input-output 
writer.  This  freed  the  keyboard  of  his  writer  and  put  it  in  the  "trans¬ 
mit"  mode,  while  simultaneously  locking  the  keyboard  of  his  partner's 
writer  by  putting  it  in  the  "receive"  mode.  Whatever  a  subject  typed 
appeared  on  his  own  terminal  in  red  and  on  his  partner's  in  black, 
thereby  identifying  the  author  of  each  message.  Control  of  the  system 
was  confirmed  for  the  subjects  by  a  green  pilot  light  on  a  small  metal 
box  mounted  just  above  the  input-output  writer.  A  subject  could  keep 
control  of  the  system  until  his  partner  pressed  the  send  key  on  the 
other  input-output  writer.  This  feature  allowed  subjects  to  interrupt 
each  other  at  any  time. 

Auxiliary  equipment.  The  subjects  received  their  instructions 
from  a  prerecorded  tape  played  over  a  Sony  Model  TC-353  stereo  tape 
recorder  which  was  equipped  with  two  detachable  speakers.  The  tape 
recorder  itself  was  kept  in  the  observation  room,  and  one  speaker  was 
placed  in  each  of  the  two  subject  rooms. 

An  SGL  Waber  Model  24  switchable  power  output  box  was  mounted 
near  the  input-output  writer  in  room  C,  the  seeker's  room.  The  power 
for  both  input-output  writers  was  routed  through  this  box,  as  was  the 
power  for  a  60-minute  electric  timer  (Standard  Electric  Time  Corpora¬ 
tion  Model  S-60-ER) .  A  discussion  of  how  this  system  worked,  and  the 
rationale  behind  it,  is  provided  in  the  Procedure  section. 


Problem-solving  Task 

The  source  was  given  a  completely  assembled  Tinkertoy^^ model , 
and  the  seeker  was  given  the  parts  from  which  he  was  required  to  build 
an  identical  model.  A  photograph  of  the  model  is  shown  in  Figure  2. 


The  model  was  made  of  ten  wooden  dowel  rods,  color  coded 
according  to  length,  and  eight  wooden  wheel-shaped  connectors.  All 
rods  were  one  quarter  inch  in  diameter;  there  were  three  green  rods 
(7.5"  in  length),  three  blue  rods  (3.25"  in  length),  and  four  yellow 
rods  (2.125"  in  length).  The  connecting  pieces  had  a  diameter  of  1.25" 
and  a  depth  of  .625".  Six  "SP"  type  connectors  were  used,  these  having 
one  quarter- inch  hole  through  their  centers  and  eight  quarter- inch 
holes  around  their  circumferences.  The  rods  fit  snugly  into  these 
holes.  Two  "W"  type  connectors  were  also  used,  these  having  a  three- 
eighths  (rather  than  one-quarter)  inch  hole  through  their  centers  , 
four  quarter-inch  holes  around  the  center  hole,  and  four  quarter-inch 
holes  around  their  circumferences. 


Experimental  Des ign 


In  all,  48  two-person  teams  were  tested  in  each  of  16  differ¬ 
ent  factor-level  combinations  (Table  L) .  There  was  one  within-teams 
factor:  job  roles;  and  four  between-teams  factors:  sex  of  seeker,  sex 
of  source,  unlimited  versus  self-limited  word  usage,  and  introduced 
versus  anonymous  team  members.  Each  factor  had  two  levels. 

There  were  two  forms  of  the  analysis  of  variance  (Table  2). 

Form  1  was  used  for  analyzing  measures  of  individual  performance  (e.g., 
number  of  words),  and  Form  2  was  used  for  measures  of  team  performance 
(e.g.,  time  to  solution). 

Independent  Var iab les 

The  measure  of  job  roles  compares  the  performance  of  the  two 
team  members,  the  source  and  seeker.  As  has  already  been  stated,  the 
source  was  given  a  completely  assembled  model,  and  was  required  to 
assist  the  seeker,  who  had  been  given  the  separate  parts  from  which  he 
or  she  was  required  to  build  an  identical  model. 

The  factors  ol  sex  of  seeker  and  sex  of  source  are  self- 
explanatory.  Because  these  factors  were  crossed  in  the  design,  there 
were  12  all-male  teams,  12  all-female,  12  mixed-sex  with  a  male  seeker, 
and  12  mixed-sex  with  a  female  seeker. 

The  factor  of  unlimited  versus  self-limited  word  usage,  hence¬ 
forth  referred  to  simply  as  the  word  usage  factor,  was  quite  similar 
to  the  one  used  by  Ford  et  al.  (1979).  Exactly  what  constitutes  a 
word  is  explained  in  the  Dependent  Measures  section,  but,  in  general,  1 
counted  as  a  word  any  string  of  one  or  more  typewriter  characters  sepa¬ 
rated  from  other  strings  by  at  least  one  blank  space.  The  word  usage 
factor  involved  allowing  half  of  the  teams  to  use  as  many  words  as  they 
liked  in  solving  the  problem,  while  penalizing  the  other  teams  for  each 
word  they  used.  This  was  accomplished  by  the  method  I  used  for  paying 
the  subjects.  All  subjects  received  a  base  pay  of  $2.00  for  partici¬ 
pating  in  the  experiment,  plus  a  bonus  for  correctly  solving  the  prob¬ 
lem.  The  subjects  on  the  unlimited  teams  received  a  fixed  bonus  of 
$1.50,  while  the  subjects  on  the  self -1  united  teams  received  a  bonus  of 
$3.25  minus  one  cent  for  each  word  that  their  team  used.  The  average 
bonus  awarded  to  self-limited  subjects  turned  out  to  be  slightly  less 
than  $2.00,  and  so  was  nearly  the  same  as  the  $1.50  bonus  received  by 
the  unlimited  subjects. 

Finally,  there  was  the  factor  of  introduced  versus  anonymous 
teams,  henceforth  referred  to  as  the  anonymity  factor.  For  halt  of  the 
teams  in  the  experiment,  the  team  members  were  allowed  to  see  each 
other  briefly  at  the  beginning  of  the  session.  I  introduced  them  bv 
saying,  "This  will  be  vour  partner  for  the  experiment,"  and  then  1  sep¬ 
arated  them  before  they  had  a  chance  to  speak  to  each  other.  The 
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Table  1 

The  Experimental  Design 


Male  Seeker 

Female 

Seeker 

Problem 

Var  tables 

Male  Source  Female  Source 

Male  Source 

Female  Source  1 

Introduced 

Unlimited 
word  usage 

T  T  T 

1  ’  2  3 

■« 

Self-limited 
word  usage 

Anonymous 

Util  imited 
word  usage 

Sel f-1 imited 
word  usage 

' 

t 

i 

T  T  T 

46’  47’  48 

At  tho  University  of  California,  Santa  Barbara,  Parrish  anJ 
Michael  Is  (submitted  for  publication)  conducted  an  experiment  to 
explore  the  behavior  of  like-  and  mixed-sex  teams  during  cooperative 
problem  solving.  The  performance  of  the  teams  was  assessed  on  time 
required  to  solve  the  problem  and  on  behavioral  measure  ot  activity 
such  as  the  type  described  by  Chapanis  et  al.  (1972).  lhe  behavioral 
data  were  obtained  through  direct  observation  of  the  subjects  while 
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Table  2 

The  Two  Forms  of  the  Analysis  of  Variance,  Showing  Sources  of 
Variation  and  tlegrees  of  Freedom  Associated  With  Each 
Source  of  Variation 


Source  of  Variation 

- . - ■ — 

Form  1 
d.f  . 

Form  2 
d.f. 

Sex  of  Seeker  (SK) 

1 

1 

Sex  of  Source  (SO) 

1 

1 

Introduced  vs.  Anonymous  (A) 

1 

1 

Unlimited  vs.  Seif-limited  Word  Usage  (W) 

1 

1 

SK  x  SO 

1 

1 

SK  x  a 

1 

1 

SK  x  w 

l 

1 

SO  x  A 

1 

1 

SO  x  w 

1 

1 

A  x  w 

1 

1 

SK  x  SO  x  a 

1 

1 

SK  x  SO  x  w 

1 

1 

SK  x  A  x  H 

1 

1 

SO  x  A  x  w 

1 

1 

SK  x  SO  x  A  x  W 

1 

1 

Teams / (SK-SO-A-W)  combos. 

32 

32 

Job  Roles  (JR) 

1 

- 

JR  x  SK 

1 

_ 

JR  x  SO 

1 

_ 

JR  x  a 

1 

_ 

JR  x  w 

1 

_ 

JR  x  SK  x  SO 

1 

_ 

JR  x  SK  x  a 

1 

JR  x  SK  x  w 

1 

_ 

JR  x  SO  x  a 

1 

_ 

JR  x  so  x  w 

1 

_ 

JR  x  a  x  w 

1 

_ 

JR  x  SK  x  SO  x  a 

1 

_ 

JR  x  SK  x  so  x  w 

1 

_ 

JK  x  SK  x  a  x  W 

1 

JR  x  SO  x  A  x  W 

1 

_ 

JR  x  SK  x  SO  x  A  x  W 

1 

- 

JR  x  Teams/ (SK-SO-A-W)  combos. 

32 

- 

TOTAL 


95 


47 
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remaining  loams  wore  anonymous,  and  the  loam  members  did  not  meet  each 
other  until  after  the  session  was  over.  During  the  course  of  the  ses¬ 
sion  they  knew  nothing  ot  each  other  save  what  they  could  deduce  from 
each  other's  typewritten  messages. 

1'ependen  t  Measures 

There  wore  four  classes  of  dependent  measure:  (1)  the  t  ime 
required  to  solve  the  problem,  (2)  various  measures  ot  verbal  produc- 
t  ivity,  (1)  experimenter  tabulat  Ions  of  various  types  of  subject  to- 
subject  information  requests,  and  (4)  responses  to  a  questionnaire 
completed  by  the  subjects  at  the  conclusion  of  their  session. 

Time  to  solution.  This  is  actually  a  measure  of  the  total 
amount  of  time  the  communication  system  was  open.  When  !  turned  on  the 
system  power  at  the  start  ot  the  session,  thereby  allowing  the  subjects 
to  communicate,  an  electric  timer  also  received  power.  When  the  seeker 
was  satisfied  that  the  model  was  properly  assembled,  he  turned  the  sys¬ 
tem  ot t  and  thereby  stopped  the  timer.  Since  this  total  time  to  solu- 
t ion  is  a  measure  of  team  performance.  Form  2  of  the  analysis  ot  vari¬ 
ance  (Table  2)  was  used  for  analyzing  these  data. 

Number  of  words.  While  it  may  at  first  appear  to  be  a  simple 
task,  it  is  actually  quite  difficult  to  determine  how  to  count  words, 
even  in  a  teletypewriter  mode.  There  is  no  agreement  on  this  even 
among  the  Hopkins  telecommunications  group.  The  word  counting  rules 
used  by  Michael is  et  al.  (1977)  are  different  from  those  used  by  Kelly 
and  Chapanis  (1977),  and  these  are  different  from  those  used  by  Ford  et 
al.  (1978).  In  general,  rules  tor  word  counting  have  been  established 
and  modified  according  to  the  context  in  which  they  will  be  used. 

For  this  study,  1  usually  counted  as  a  word  any  string  ot  one 
or  more  typewriter  characters  surrounded  by  spaces.  Here  are  the  spe¬ 
cific  rules  1  used: 

1.  In  general,  any  string  of  letters,  whether  they  spell  any¬ 
thing  or  not,  counted  as  a  word.  For  example,  "word," 
"wodr,"  and  "owdr"  would  each  count  as  one  word.  Single 
letters  by  themselves,  such  as  "A,"  "B,"  or  "C"  would  be 
counted  as  words. 

2.  Numbers  count  as  words.  For  example,  "3,"  "25,"  and  "1/2" 
would  each  count  as  a  word. 

3.  Special  typewriter  symbols,  such  as  "% , "  or  "+"  each 

count  as  individual  words. 

4.  Punctuation  marks  not  used  solely  for  punctuation  were 

counted  as  words.  For  example,  "?????"  and  " . "  each 

count  as  individual  words. 


5.  Standard  contractions,  such  as  "can't"  or  "I'm"  each  count 
as  one  word. 

6.  Words  run  together  or  erroneously  spelled  were  counted  as 
their  apparent  constituent  words.  For  example,  "erron 
eously  spaced"  and  "runtogethe r"  would  each  count  as  two 
words. 

7.  Strikeovers  count  as  words.  For  example,  1"  would  be 

counted  as  one  word:  1^"  as  two. 

The  number  of  words  used  by  each  subject  was  counted  indepen¬ 
dently  by  a  research  assistant  and  by  me.  Disagreements  were  rare,  and 
were  resolved  to  yield  the  values  used  in  the  analyses. 

Messages .  A  message  began  when  a  subject  obtained  control  of 
the  communication  system  and  started  typing.  The  message  ended  when 
the  subject's  partner  obtained  control  and  typed  something  in  return. 

It  did  not  count  as  a  message  if  a  subject  merely  obtained  control  of 
the  system  and  then  lost  it  before  typing  anything.  Messages  had  to  be 
at  least  one  typewriter  character  (i.e. ,  one  word)  in  length. 

Although  the  source  and  seeker  were  taking  turns  sending  mes¬ 
sages,  the  number  of  messages  sent  by  one  does  not  have  to  equal  the 
number  sent  by  the  other.  If  the  subject  who  sent  the  first  message 
is  also  the  subject  who  sent  the  last,  then  he  will  have  sent  one  more 
message  than  his  partner.  For  this  reason,  Form  1  rather  than  Form  2 
of  the  analysis  of  variance  (Table  2)  was  used  for  this  measure. 

Mean  message  length.  The  mean  message  length  is  a  derived 
measure,  obtained  by  taking  the  total  number  of  words  used  by  a  subject 
and  dividing  by  the  total  number  of  messages  generated  by  that  subject. 

Communication  rate.  Communication  rate  is  another  derived 
measure,  obtained  by  taking  the  total  number  of  words  used  by  a  subject 
and  dividing  by  his  team's  time  to  solution.  Notice  that  this  is  not  a 
pure  measure  of  communication  rate,  in  that  it  is  based  on  overall  time 
to  solution  and  not  the  amount  of  time  actually  spent  communicating. 

Requests  for  information.  The  five  measures  just  described 
have  the  advantage  of  being  quantitative.  However,  they  only  measure 
how  much  was  said  and  how  fast.  They  do  not  give  us  an  idea  of  how 
well  the  subjects  understood  each  other,  a  factor  which  gets  at  the 
essence  of  this  study.  I  decided  that  the  best  way  to  judge  this  was 
to  examine  the  record  of  each  subject's  communication  with  his  partner 
to  see  how  many  and  what  kinds  of  questions  were  asked. 

1  should  stress  at  this  point  that  1  was  not  interested  solely 
in  questions,  at  least  according  to  the  grammatical  sense  of  that 
word.  My  interest  lay  in  what  I  refer  to  as  requests  for  information. 


This  is  because  the  question,  "Where  does  this  piece  go?"  is,  for  my 
purposes,  functionally  the  same  as  the  simple  imperative  statement, 
"Tell  me  where  this  piece  goes." 

It  proved  to  be  extremely  difficult  to  develop  a  reliable, 
unambiguous  coding  scheme  to  rate  the  various  types  of  Information 
requests.  This  was  primarily  due  to  the  remarkably  unstructured  man¬ 
ner  in  which  the  subjects  communicated.  (This  aspect  of  natural  human 
communication  is  discussed  in  detail  by  Chapanis  et  al . ,  1977,  and 
Michaelis  et  al.,  1977.)  After  much  trfal  and  error,  and  with  a  great 
deal  of  help  from  my  research  assistant,  the  following  coding  scheme 
was  developed: 

1.  Requests  for  New  Information.  (e.g.,  "What  do  I  do 
next  ?") 

2.  Requests  for  Clarification  of  Previously  Given  Information, 
(e.g.,  "Could  you  go  over  that  last  part  again?' or  "Now  I'm 
completely  lost.") 

3.  Requests  for  Confirmation  of  Own  Understanding  or  Status, 
(e.g.,  "Should  those  pieces  form  a  triangle  when  1  put 
them  together?") 

4.  Requests  for  Confirmation  of  Partner's  Understanding  or 
Status.  (e.g.,  "Do  you  understand?"  or  "Have  you  assembled 
the  base  of  the  model  yet?") 

5.  Other.  (e.g.,  "What  time  is  It?") 

The  original  typewritten  protocols  from  all  48  experimental 
sessions  were  coded  by  my  research  assistant.  1  did  not  code  them  my¬ 
self  because  of  the  possibility  that  my  expectations  could  affect  my 
judgment.  However,  to  verify  the  reliability  of  the  coding  scheme,  1 
independently  coded  five  randomly  chosen  protocols,  and  came  up  with 
virtually  the  same  tabulations  as  did  my  research  assistant. 

(Questionnaire  data.  All  subjects  were  asked  to  complete  briet 
questionnaires  at  the  end  of  their  sessions.  The  questionnaire  had 
been  given  to  them  inside  a  sealed  envelope  before  the  start  ot  the 
session.  The  seeker  was  instructed  to  open  the  envelope  and  work  on 
the  questionnaire  right  after  turning  ot f  the  system;  the  source  was 
instructed  to  do  the  same  after  hearing  the  power  go  off  In  his  input- 
output  writer. 

The  subjects  on  the  introduced  teams  received  the  one-page 
questionnaire  reproduced  in  Figure  1.  For  the  subsequent  analysis, 
t he  answers  they  gave  to  each  of  the  five  quest  ions  were  converted  to 
numerical  values,  one  for  the  box  on  the  far  left  to  seven  for  the  far 
right,  for  the  seven  possible  answers  on  the  rating  scales. 


This  questionnaire  is,  of  course,  voluntary  and  strictly  anonymous. 

DO  NOT  SIGN  YOUR  NAME.  The  five  questions  below  simply  ask  you  to  rate 
your  own  and  your  partner's  performance.  Please  check  the  appropriate  box 

YOUR  OWN  MESSAGES: 

1)  Rate  the  CLARITY  of  your  own  messages  to  vour  partner: 

Unclear  and  I  I  I  I  I  I  j  J  j  1  I  |  I  j  Clear  and 

Obtuse  j _ J  | _ j  | _ j  j _ j  ( _ j  J _ j  j _ J  Understandable 


2)  Rate  the  BREVITY  of  your  own  messages  to  your  partner: 

“srlly  □□□□□□□  Jr.  »r 


YOUR  PARTNER'S  MESSAGES 

3)  Rate  the  CLARITY  of  your  partner's  messages  to  you: 

Unclear  and  [  ~j  I  j  I  )  I  j  I  J  J"  “J  T ~  Clear  and 

Obtuse  ( _ j  | _ _J  j _ J  ( _ |  ( _ |  j _ (  | _ j  Understandable 


4)  Rate  the  BREVITY  of  your  partner's  messages  to  you: 

sr'r11’  □□□□□□□  sir 


5)  Rate  how  well  you  and  your  partner  were  able  to  work  together: 

The  two  of  us  The  two  o 

made  a  terri¬ 
ble  team. 


□  □□□□□□  =2.—' 


Figure  3. 


The  first  page  of  the  questionnaire. 


The  subjects  Oil  t  lie  anonymous  teams  received  .1  (wo  page  quo*, 
t  ionna  l  re .  Tlio  t  irst  page  was  the  same  as  t  lie  quest  iontiaiic  given  1 1> 
t  be  lilt  reduced  teams  (Figure  1)  .  Hie  second  page  Is  shown  In  Figure  «. 
1  was  interested  m  t  he  i  t  answers  to  the  second  .|nest  ion  on  this  pace, 
rhis  1;.  because  the  majot  focus  ot  mv  studv  i  s  on  -ex  cttccts,  and.  In 
Interpret  itig  my  t  indiums,  I  need  to  know  It  m\  anonvuious  teams  i>  allv 
were  anonymous  with  regard  to  sex.  Note  that  this  question  is  actuallx 
composed  ot  throe  distinct  parts:  ( O  What  is  vour  partner’s  os' 

(.')  How  sure  are  you?  and  i.31  What  Is  the  basis  tot  vein  decision'.’ 

Ibis  allows  me  not  only  to  judge  the  cited  ivoncss  ot  mv  anonvmltx 
condition,  but  also  provides  me  an  Insight  Into  the  wax  people  t  o  nn 
Judgments  o t  others  when  thee  .ire  common icat lug  over  a  tolotvpowt  itei 
svst  em. 

l'he  introduction  to  this  page  o  t  the  quest  lonnaire,  as  well  as 
ijuest  ix'its  1,  1,  and  ■! ,  hail  as  theii  primary  purpose  to  ptevent  the  sub¬ 

jects  irom  realizing  that  mine  was  a  studv  ot  sex  ditteieiiees,  1  did 
not  analyze  the  answers  to  these  quest  ions ,  except  as  they  related  to 
t  he  sex  q mss  t  1  on  . 

1'  root* dll  t  e 

l he  subjects  wote  solicited  by  postcis,  put  up  at  vai  ions  Icc.i 
t  ions  on  the  .Johns  Hopkins  Ihitversltv  campus.  The  postets  directed 
interested  students  to  a  sign-up  sheet  which  had  been  posted  on  tin- 
door  t  o  my  i>  t  t  ice.  l'he  sheet  asked  them  tot  then  names  and  telephone 
numbers . 

Scheduling  ot  subjects.  Kverv  Sunday  evening,  dui  ing  the  eom-.e 
ot  tin*  experiment,  1  randomly  selected  tour  teen  to  twenty  potential 
subjects  from  the  slgn-up  list  and  telephoned  them.  I  described  mv 
experiment  a;;  involving  two-person  teams  common  i  eat  i  tig  via  teletype 
writers  to  solve  a  problem,  asked  them  it  they  were  Interested  and,  it 
so,  when  during  the  upcoming  week  tliev  would  be  available  to  part  ict 
pate.  Once  I  had  this  informat ion  from  all  ot  the  available  subjects, 

I  randomly  assigned  them  to  teams  within  the  various  conditions,  .lob 
roles  in  t ho  mixed-sex  teams  wore,  ot  course,  assigned  on  the  basis  ot 
sex,  and  in  the  llko-sox  teams  were  as  signed  on  the  basis  ot  a  coin 
t  lip.  After  I  had  paired  the  subjects  into  teams,  1  examined  the  time 
schedules  they  had  given  me  to  find  a  mutually  agreeable  t  Into  lot  t  ho 
t  wx'  subjects.  1  then  telephoned  them  to  toll  them  when  and  where  t  c 
repo rt  . 

Subjects  who  were  to  bo  sources  were  told  t o  report  t o  Ames 
Hall  room  ,’Vi  ( room  A  In  Figure  11,  while  tuturc  seekers  were  told  to 
report  to  room  .’lb  (room  01.  l'he  subjects  wore  told  that  they  would 
find  the  door  to  their  room  open,  and  that  they  weto  t  c<  come  lit,  close 
the  door,  take  a  seat  ,  and  wait  tor  me  t o  conn*  in  and  speak  with  them. 

Notice  in  Figure  1  that  the  two  subject  rooms  are  on  ditterent 
hallways.  This  helped  prevent  tin*  subjects  irom  seeing  each  othei  In 
advance,  and  from  real  Izlng  that  they  were  to  part  icipate  in  t ho  s.uno 
oxper  imont . 


Soma  scientists  have  suggested  that  the  teletypewriter  wide  of  common  icat  ion 
Is  too  "impersonal  especially  when  competed  with  other  telocommunicat  ion  modes, 
such  as  telephone  or  closed-circuit  television.  Although  teletypewriters  may  seem 
to  be  more  impersonal  than  these  other  modes,  this  has  yet  to  be  scientifically 
demonstrated. 

Both  you  and  your  partner  are  now  working  on  this  questionnaire,  which  is 
my  way  of  trying  to  find  out  Just  how  impersonal  teletypewriters  are.  First, 
please  answer  these  questions  so  we  can  compare  your  description  of  yourselt 
with  your  partner's  description  of  you: 


Your  year  in  school: 


Your  academic  major: 


Your  sea: 


Your  hometown  : 


Mow,  try  to  answer  the  questions  below  based  solely  on  what  you  have  learned 
about  your  partner  during  the  course  of  v’ur  teletypewriter  conversation.  1’leuse 
indicate  how  sure  you  are  of  each  answer  by  checking  the  appropriate  box  in  the 
rating  scale;  if  your  answer  is  more  than  "Just  a  guess"  then  please  write  on  the 
back  of  this  page  what  you  based  your  decision  on . 


1)  Uhat  is  your  partner's  year  in  school: 
How  sure  are  you? 


Just  a 

guess 


□  □□□□□□ 


Absolutely 

certain 


2)  What  is  your  partner's  sax?  _ 

How  sure  are  you? 

Absolutely 
certain 


X-  □□□□□□□ 


3)  What  is  your  partner's  academic  major? 
How  sure  are  you? 

Just  a 
guess 


□  □□□□□□ 


Absolutely 
ce  rtain 


4)  In  what  geographic  region  of  the  U.S.  is  your  partner's  hometown’ 


How  sure  are  you? 


•*  □□□□□□□ 


t  ho  quost  1 onna ( t  o . 


Figure  4. 


The  second  page  of 


I’tcpaiat  Ion  tot  t  lie  subject-..  Hetore  the  subjects  wcte  to 
arrive,  1  put  on  tin*  worktables  In  i'll  oli  room  a  slaudaid  subject  con- 
soiit  form,  .1  sheet  of  blank  paper  that  the  subjects  could  use  tot 
notes,  .uni  three  -harp  pencils.  I  left  the  doors  to  the  hallways  wide 
open,  ami  the  iloots  to  the  ohseivatlon  t  ovxn  shut  .  I  positioned  mvsell 
between  tin-  two  one  wav  mlrrot--  In  the  ohseivatlon  toom  ami  walled  toi 
t  he  sub  leet  to  at i  ive. 

I  he  aitlval  ot  the  subjects.  Ivon  though  tin-  two  suh|eets  on 
a  team  were  both  assigned  the  same  arrival  time,  out  ot  all  48  team-., 

1  did  not  have  one  whete  the  subjects  arrived  within  two  minute-,  ot 
eaeh  ot  In- 1  . 


It  the  team  was  to  he  Introduced,  when  the  ttist  sub  Icy t 
arrived  1  poked  mv  head  thtouv.h  tin  ohse  tv.it  t  i>n  loom  dooi  and  . .  1 1 »! , 
"We'll  he  readv  to  he>-,ln  as  oon  is  vom  partnei  arrives."  When  the 
second  subject  aiitved,  1  allowed  him  t  v  biietly  -.t  .  nl  in  the  doorway 
t  o  tin-  tlrst  .uh|eet'-.  room  as  I  tut  t«*dueed  them  with-  "This  i -.  vom 
partnet  toi  the  experiment."  1  then  immediately  esiviteJ  tin-  second 
subject  tv'  his  -eat  In  the  ot  bet  i  v*om .  N.'t  Ice  In  t'lguic  1  that  at 
this  point,  even  with  the  observat  Ion  i  vu'in  doois  i>p«*n,  tin-  sub  |  eels 
eon  I  vl  o  t  see  ,-ac  h  v>  t  In- 1  . 


1  then  stood  between  the  t  wo  open  ohsetvat  ion  mom  dvx'i--,  -.aid 
"I'll  start  placing  von  the  tape  with  yout  Inst  tint  ions  i>n  it,"  and 
shut  t  he  dooi  -. . 


Obvion-.lv,  then-  was  no  li  -duct  ton  with  the  anonymous  teams, 
although  mo  t  .-liiei  a-.pivt  ;;  ot  the  pu  I  were  tin-  same.  Vhe  vl  i  1 
I  v  rener  ts  ili.it  \.  ien  tin  second  subject  at  t  simply  poked  mv 

head  throngs  the  doo  i  to  In-  i  oom  and  intoniuv  til  that  hi?,  p.utnei 

had  alteadv  it  tlv.-d  o  1  was  going  tv'  begin  playing  the  ins  t  rue  t  I  on 

tape.  I  closed  the  dooi  attei  me,  i|U  i  e  k  1 V  Informed  the  tltst  sub  .  .( 

that  we  wen-  te.ivly  to  beg  in,  and  then  started  pluving  the  tape. 

Hie  ins  t  i  ml  li'ns  .  in  all  ptev  toils  Hi'pkins  t  e  lecy'imniiti  I  vat  ton 
experiments,  tin-  expet  inu-met  has  le.ivl  the  Inst  rui-t  Urns  tv>  the  sub 
icets  taee  to  lava-.  However,  keep  iny  In  miitvl  Rv'sent  tut  1 1  s  tll|'t>'  t  tml 
lugs  ah  -lit  the  et  Icets  ot  uiiuitent  Iv'iial  expet  'Intent  ei  sub  jt-et  lntetav 
t  Ion,  1  elected  tv'  prerecord  my  instructions  on  tape  tv'  Insure  that  all 
teams  heard  instructions  that  wen  identical,  both  verbally  and  non 
ve  t  ha  l  U- . 


There  wen-  two  sets  o!  instinct  ton--,  one  toi  the  unlimited 
teams  and  one  tot  the  sell  limited.  These  dltlctcd  only  in  thelt 
explanation  ot  the  bonus  payment  system. 

The  Inst  t  net  ions  began  bv  thanking  the  sub  (eels  tot  p.-t  it  ( c  t  pa 
t  lug  in  mv  experiment ,  the  purpose  o t  which,  1  said,  was  t o  see  what 
happens  when  people  solve  problems  by  common t eat  lug  ovet  a  teletype 
wiltct  system.  I  went  on  t  o  explain  the  u.it  me  ot  the  problem  t  hey 
had  tv'  solve  (i.e.,  the  assembly-  ot  the  mode ll ,  nut  what  tlu-lt  |oh  roles 
would  he. 


Next  ,  l  explained  the  payment  system.  1  did  not  at  tins  t  line 
fully  explain  my  rules  for  word  counting  to  the  self-limited  subjects. 
Rather,  I  told  them  that  I  would  shortly  be  giving  them  a  list  of 
rules  to  which  they  could  refer  during  the  course  of  the  session. 

1  then  provided  a  thorough  explanation  ot  how  to  use  the  tele 
typewriter  system. 

After  that,  I  explained  to  the  subjects  that  it  would  ruin  inv 
analyses  it,  during  the  session,  they  engaged  in  any  sort  ot  "chit¬ 
chat,"  or  especially  if  they  asked  tor  or  gave  anv  information  not 
directly  related  to  assembling  the  model.  1  told  them  that,  it  they 
wanted,  they  would  have  a  chance  to  get  to  know  each  other  or  plav 
with  the  teletypewriters  after  the  session  was  over. 

Finally,  l  concluded  the  instruction  tape  hy  telling  them  that 
1  would  be  in  to  talk  with  them  shortly  to  answer  their  questions  and 
provide  them  with  the  material  they  would  need  for  the  experiment. 

After  the  tape  was  over,  1  poked  my  head  into  the  seeker's 
room  and  said,  "I'll  be  with  you  in  a  few  minutes.  1  want  to  talk  to 
your  partner  first."  In  addition,  it  the  team  was  to  be  sol f- 1  unit ed, 

I  gave  the  seeker  a  copy  of  word  counting  rules  and  suggested  he  exam¬ 
ine  them  until  1  returned.  Tills  list  of  word  counting  rules  was  iden¬ 
tical  to  the  list  of  rules  presented  in  the  Dependent  Measures  sect  ion 
of  this  paper. 

1  then  entered  the  source’s  room,  carrying  witli  me  a  box  con 
tainlng  the  assembled  model  and  the  envelope  with  the  quest  ionnai i e. 

If  the  team  was  self-limited  1  also  brought  a  copy  ot  the  word  count  tng 
rules  and  had  the  source  examine  it  before  1  did  anv thing  else.  1 
allowed  the  source  to  ask  questions,  which  1  answered  hv  paraphrasing 
what  had  already  been  said  on  the  taped  inst ruct  ions .  1  put  the  box 

and  the  envelope  on  the  worktable  and  said,  "1  want  to  go  talk  to  vour 
partner  now.  Please  don't  examine  the  model  until  you  hear  the  system 
come  on,  and  don't  examine  the  questionnaire  until  the  system  goes  off." 
I  then  asked  the  subject  to  sign  the  consent  and  payment  forms  and  give 
t  hem  t  o  me . 

1  followed  essentially  the  same  procedure  with  the  seeker, 
except,  of  course,  1  gave  the  seeker  a  box  containing  separate  parts 
rather  than  an  assembled  model.  Also,  before  leaving,  1  made  certain 
that  the  seeker  knew  when  and  how  to  shut  the  communie.it  ion  system  ot  t  . 

The  experimental  session.  The  actual  experimental  session  began 
as  soon  as  1  turned  on  the  system  power.  The  motors  in  the  input -out¬ 
put  writers  make  a  clearly  audible  noise  when  thov  lirst  receive  power, 
and  this  was  the  signal  for  the  subjects  to  begin  work.  From  my  van¬ 
tage  point  in  the  observation  room,  1  could  watch  the  subjects  while 
they  worked,  and  could  even  read  what  they  were  typing. 
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l'he  timed  portion  ot  the  session  ended  when  the  seeker  shut 
off  the  s'-  tem,  at  which  point  the  subjects  began  work  on  their  ques¬ 
tionnaires.  When  they  were  done,  I  permitted  them  to  meet  each  other 
and  discussed  various  aspects  of  the  experiment  with  them.  I  thanked 
them  when  they  left ,  and  asked  them  to  encourage  others  to  sign  up 
for  the  expei iment  .  The  subjects  subsequently  received  their  payment 
checks  through  the  campus  mail,  usually  within  a  few  days. 

Results  and  Discussion 

All  48  teams  reached  a  solution.  Only  one  team,  an  all-lerault 
self-limited  anonymous  team,  tailed  to  reach  t  lie  correct  solution.  In 
other  words,  the  seeker  on  that  team  failed  to  build  an  exact  replica 
of  the  source's  model.  Because  47  of  the  48  teams  reached  the  correct 
solution,  accuracy  ot  solution  is  not  a  useful  dependent  measure  for 
this  study,  and  will  In-  ignored  in  the  remaining  analyses.  However, 
before  any  further  analyses  are  done,  it  is  necessary  to  examine  the 
quest ionna i re  data  from  the  anonymous  subjects  to  determine  it  they 
genuinely  were  anonymous . 

As  was  explained  in  the  Quest  lonnaire  data  section,  t  lie  anony¬ 
mous  team  members  were  asked  about  the  sex  of  their  partner,  how  sure 
they  were,  and  what  they  based  their  decision  on. 

There  were  48  anonymous  team  members,  31  of  whom  correctly- 
guessed  the  sex  of  their  partner.  Assuming,  that  this  is  a  simple 
binomial  choice,  with  the  likelihood  of  one  answer  equaling  the  likeli¬ 
hood  of  the  other,  the  probability  of  getting  by  chance  31  or  more 
correct  answers  equals  0.03.  This  is  less  than  the  traditional  cut-oti 
level  for  statistical  significance  of  0.05,  but  not  by  much. 

A  closer  examination  reveals  that  the  correct  responses  were 
fairly  evenly  divided  between  job  roles,  16  out  of  34  correct  for  seek¬ 
ers,  15  out  of  34  correct  for  sources.  However,  broken  down  by  sex, 
there  were  13  right  out  of  24  for  males,  as  compared  with  18  right  out 
of  24  for  the  females.  The  probability  of  getting  bv  chance  18  or  more 
correct  out  of  24  equals  0.01,  which  is  statistically  significant. 

In  addition  to  deciding  about  their  partner's  sex,  the  subjects 
had  to  indicate  how  sure  they  were  of  their  answer  on  a  scale  of  one  to 
seven,  where  one  equals  "just  a  guess"  and  seven  equals  "absolutely 
certain."  I  took  the  numerical  rating  from  this  scale  and  recorded  it 
as  a  positive  number  if  they  had  correctly  guessed  their  partner's  sex, 
and  as  a  negative  number  if  they  had  not.  For  example,  it  a  subject 
incorrect Iv  guessed  his  partner's  sex,  and  checked  the  third  box  on  the 
scale,  I  recorded  a  -I  for  him.  In  this  way  I  constructed  a  scale  that 
runs  from  -7  to  +7,  with  zero  as  the  mld-polnt.  On  this  scale  tin- 
average  score  for  the  subjects  was  0.23,  with  a  standard  deviation  ot 
4.12.  A  t-test  revealed  that  this  mean  score  was  not  significantly 


different  t rom  zero.  Broken  down  by  sex,  the  females  averaged  0.9t>  on 
tins  scale,  while  the  males  averaged  -0.*).  Neither  of  these  were  sig¬ 
nificantly  different  from  zero,  nor  were  they  significantly  different 
I rom  each  other. 

Out  of  the  48  subjects,  3b  rated  their  assuredness  as  2  or 
higher  on  the  7-point  scale.  In  other  words,  3b  subjects  felt  that 
their  judgment  ot  their  partner's  sex  was  more  than  "just  a  guess." 

Of  those  3b,  13  were  wrong.  Once  again  making  use  of  the  binomial 
tests,  the  probability  of  getting  by  chance  13  or  fewer  wrong  answers 
out  of  3b  is  greater  than  0.2,  clearly  not  significant. 

The  12  remaining  subjects  replied  with  what  they  considered  to 
be  just  guesses.  However,  10  of  these  12  made  the  right  guess.  The 
probability  of  having,  by  chance,  two  or  fewer  wrong  answers  out  of  12 
is  less  than  0.02.  So,  the  subjects  who  were  more  sure  of  their 
answers  did  far  worse  than  those  who  said  that  they  had  simply  guessed, 
which  is  why  some  of  the  simple  binomial  tests  reached  significance, 
although  the  t-tests  on  the  corresponding  rating  scale  scores  did  not. 

The  subjects  who  answered  with  "more  than  just  a  guess"  were 
asked  to  explain  why.  The  seeker  on  one  all-male  team  replied  that  his 
partner  "sounded  educated,"  and  was  therefore  female.  A  male  on 
another  team  wrote,  "My  partner  used  the  word  'botched'  and  women  don't 
use  that  word."  His  partner  proved  to  be  the  exception  to  that  rule. 
Still  others  rated  their  partners  solely  on  the  basis  ot  typing 
ability,  with  the  popularly  held  stereotype  apparently  being  that 
females  are  better  typists  than  are  males. 

It  is  mv  belief  that,  on  the  average,  the  subjects  who  "just 
guessed"  were  basing  their  responses  on  indicators  ot  sex  that  were  tar 
more  subtle,  yet  apparently  more  accurate  than  those  used  by  the  other 
subjects.  The  findings  also  suggest  that  females  may  he  better  than 
males  at  detecting  and  properly  interpreting  these  indicators.  How¬ 
ever,  the  evidence  for  these  assertions  is  far  from  conclusive. 

Further,  and  of  more  Importance  to  this  study,  the  48  anonymous 
subjects  averaged  only  0.23  on  the  -7  to  +7  judgment  scale,  confirming 
that,  unlike  their  introduced  counterparts,  the  anonymous  subjects 
interacted  without  at  any  time  knowing  the  sex  ot  their  partners. 

The  Univariate  Analyses  ot  Variance 

After  confirming  the  effectiveness  ot  the  anonymous  condition, 
t ho  next  step  was  to  perform  univariate  analyses  ot  variance  on  each  ot 
the  fifteen  dependent  measures  used  in  this  study.  The  result;  ot  the 
analyses  done  on  the  time  and  verbal  measures  are  shown  in  Table  3,  the 
different  types  of  requests  for  Information  in  Table  4,  and  the  ques¬ 
tionnaire  data  in  Table  5. 
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Table  3 

The  £  Values  from  the  Analyses  of  Variance  Performed  on  the 

Time  and  Verbal  Productivity  Measures 

The  values  shown  are  for  the  upper  end  of  the  confidence  interval: 

0.05  >  £  >  0.025;  0.025  >  £  >  0.01;  0.01  >  £  >  0.005;  0.005  >  £  >  0.001 


_ _ Dependent  Measures  _ 

Source  of  Time  to  Words  Messages  Message  Communica- 

Variation _ _ _ Solut  ion _ _  Length t  ion  Rate 

Sex  of  Seeker  (SK) 

Sox  of  Source  (SO)  .05  .025 

Anonymity  (A) 

Word  Usage  (W)  .005  .001  .001  .001  .001 

SK  x  SO 

SK  x  A 

SK  x  W 

SO  x  A 

SO  x  W 

A  x  W 

SK  x  SO  x  A 
SK  x  SO  x  W 
SK  x  A  x  W 
SO  x  A  x  W 
SK  x  SO  x  A  x  W 

Job  Roles  (JR)  -  .001  .001  .001 

JR  x  SK  - 

JR  x  SO  - 

JR  x  A  - 

JR  x  W  -  .001  .001  .005 

JR  x  SK  x  SO  -  .05 

JR  x  SK  x  A  - 

JR  x  SK  x  W  - 

JR  x  SO  x  A  - 

JR  x  SO  x  W  - 

JR  x  A  x  W  - 

JR  x  SK  x  SO  x  A  - 

JR  x  SK  x  SO  x  W  - 

JR  x  SK  x  A  x  W  - 

JR  x  SO  x  A  x  W  - 

JR  x  SK  x  SO  x  A  x  W  - 


r 


Table  4 

The  £  Values  from  the  Analyses  of  Variance  Performed  on  the 
Different  Types  ol  Information  Requests 


- - 

Source  of 
Var iat ion 

Dependent  Measures 

New  In  for¬ 
mat  Ion 

Clarif i- 

cat ion 

Own  Under¬ 
standing 

Partner ' s 
Under- 
st  an  d  in  g 

Other 

Sex 

of  Seeker  (SK) 

.05 

Sex 

of  Source  (SO) 

.05 

Anonymity 

(A) 

Word 

Usage 

*  (W) 

.001 

.02  5 

.001 

SK  x 

SO 

SK  x 

A 

SK  x 

W 

.025 

SO  x 

A 

SO  x 

W 

A  x  I 

w 

SK  x 

SO  x 

A 

SK  x 

SO  x 

W 

SK  x 

A  x  W 

.025 

SO  x 

A  x  W 

SK  x 

SO  x 

A  x  W 

Job 

Ro  les 

(JR) 

.001 

.001 

.001 

.001 

JR  x 

SK 

JR  x 

SO 

JR  x 

A 

.05 

JR  x 

W 

.001 

.025 

.001 

JR  x 

SK  x 

SO 

JR  x 

SK  x 

A 

JR  x 

SK  x 

W 

JR  x 

SO  X 

A 

JR  x 

SO  X 

U 

JR  x 

A  x  W 

JR  x 

SK  x 

SO  x  A 

JR  x 

SK  x 

SO  x  W 

JR  x 

SK  x 

A  x  W 

.05 

JR  x 

SO  x 

A  x  W 

JR  x 

SK  x 

SO  x  A  x  w 

Table  5 


The  £  Values  from  the  Analyses  of  Variance  Performed  on  the 

Questionnaire  Data 


' 

Dependent  Measures 

— 

Source  of 

Var iat ion 

Seekers  ' 
_ Clarity 

'  Seekers'  Sources'  Sources' 

B  rev  ity _ Clarity  Brevity 

Team  Per 
f ormance 

Sex  of  Seeker  (SK)  .025 

Sex  of  Source  (SO) 

Anonymity  (A) 

Word  Usage  (W) 

SK  x  SO 
SK  x  A 
SK  x  W 
SO  x  A 
SO  x  W 
A  x  W 

SK  x  SO  x  A 
SK  x  SO  x  « 

SK  x  A  x  W 
SO  x  A  x  W 
SK  x  SO  x  A  x  W 

Job  Roles  (JR) 

JR  x  SK 
JR  x  SO 
JR  x  A 
JR  x  W 

JR  x  SK  x  SO 
JR  x  JK  x  A 
JR  x  SK  x  W 
JR  x  SO  x  A 
JR  x  SO  x  w 
JR  x  Ax  W 
JR  x  SK  x  SO  x  A 
JR  x  SK  x  SO  x  w 
JR  x  SK  x  A  x  w 
JR  x  SO  x  a  x  w 

JR  x  SK  x  SO  x  a  x  W 


.001  .01 


.05 


.05 


.05 


As  can  be  seen  from  these  three  tables,  449  separate  F-ratios 
were  computed.  By  definition,  about  22  of  the  449  F's  would  be  expected 
to  reach  the  0.05  level  of  significance  by  chance.  In  actual  fact,  36 
reached  significance,  and  it  is  not  immediately  clear  which  of  these 
36  indicate  the  presence  of  a  genuine  effect  and  which  are  spurious 
signif icances. 

Nine  of  the  significances  are  for  the  main  effect  of  job  roles. 
Eight  of  these  are  significant  beyond  the  0.001  level.  However,  this 
is  neither  surprising  nor  terribly  interesting.  All  of  the  previous 
Hopkins  studies  that  have  had  specific  job  roles  have  reported  highly 
significant  differences  attributable  to  this  factor.  Although  the 
exact  nature  of  the  differences  in  this  experiment  will  soon  be  dis¬ 
cussed  in  detail,  in  general  they  mean  simply  that  people  who  build 
models  do  not  behave  the  same  way  as  those  who  give  instructions. 

There  were  eleven  significant  interactions  of  job  roles  with 
either  between-teams  main  effects  or  between-teams  effect  interactions. 
In  seven  of  these  cases,  the  corresponding  between-teams  main  effect  or 
interaction  was  also  significant  by  itself.  As  a  general  simplifica¬ 
tion  for  these  seven  instances,  not  only  did  the  particular  between- 
teams  factor  affect  overall  team  performance,  but  it  also  affected  the 
source  and  seeker  differently,  hence  the  significant  interaction  with 
the  job  roles  factor. 

The  four  remaining  significant  interactions  with  job  roles 
barely  reached  significance  (0.05  >  £  >  0.025),  and  may  therefore  be 
chance  effects.  This  is  not  saying  the  seven  other  significant  job 
roles  interactions  are  not  chance  effects.  They  could  easily  be.  In 
order  to  demonstrate  that  they  are  probably  not  chance  effects,  it  is 
necessary  to  demonstrate  that  the  between-teams  component  of  the  inter¬ 
action  was  itself  not  a  chance  effect. 

All  previous  Hopkins  telecommunications  studies  have  used  a 
relatively  simple  method  for  determining  chance  effects.  The  method 
had  been  to  ask,  "For  this  source  of  variation,  what  is  the  overall 
probability  of  obtaining  by  chance  the  number  and  magnitude  of  signifi¬ 
cant  probabilities  that  were  observed?"  If  the  probability  level  thus 
computed  was  greater  than  a  previously  chosen  Alpha  level  (usually 
0.05)  then  all  significant  univariate  probabilities  for  that  source  of 
variation  were  assumed  to  be  chance  effects. 

This  method  has  one  important  underlying  assumption:  The 
dependent  measures  must  be  uncorrelated.  Unfortunately,  this  is  not  by 
any  means  a  valid  assumption,  as  will  be  demonstrated  in  the  Intercor¬ 
relations  of  Dependent  Measures  section.  Because  the  measures  are 
intercorrelated,  the  probabilities  of  their  reaching  significance  are 
linked,  thereby  invalidating  the  above  described  method  of  determining 
chance  effects.  A  set  of  several  dependent  measures  may  show  signifi¬ 
cances  solely  because  all  these  measures  are  simply  different  ways  of 


describing  the  same  single  underlying  (actor.  It  follows  tli.it  this 
single  factor  might  cause  .1  chance  effect ,  and  this  would  show  up  as 
signlt icances  lot  all  the  different  dependent  measures  that  describe 
It  . 

The  Multivariate  Analysis  ol  Variance 

Clearly,  what  is  needed  Is  an  analysis  of  variance  done  on 
these  underlying  factors  rather  tlian  on  the  actual  dependent  measures. 
Great  ly  slmplltied,  this  is  what  is  done  by  the  multivariate  analysis 
of  variance,  or  MANOVA,  which  computes  a  single  l-’-rat  lo  tor  each 
source  ot  variation.  This  K  is  computed  using  a  linear  combination  of 
all  applicable  dependent  measures,  with  the  factor  weights  in  the  1  in 
ear  combination  being  chosen  so  as  to  maximize  the  resulting  F-ratlo. 

In  getteral,  it  the  multivariate  F-rat  io  tor  a  certain  source 
of  varlat  ion  tails  to  reach  significance,  then  all  significant  unlv.n  i 
ate  F's  tot  that  source  of  variat  ion  are  assumed  to  be  chance  eftects. 
On  the  other  hand.  It  a  multivariate  !•’  tor  a  source  ol  variation  does 
reach  siguit  icunee,  then  all  corresponding  signif  ieant  univariate  K' 
are  regarded  as  being  genuinely  sign  i  I  i  cattl . 

This  is  the  method  1  used  for  determining  which  of  the  between- 
teams  effects  In  Tallies  ',  4,  and  4  were  genuine  and  which  were  chance. 
The  results  of  the  MANOVA  arc  summarized  in  Table  (>,  which  shows  the 
multivariate  F  and  corresponding  p  level  lor  all  bet  wean- teams  sources 
ot  variat ion.  As  can  be  seen  in  Table  b,  only  three  sources  ol  vai ia- 
t  ion  were  significant  beyond  the  O.O'i  level:  sex  of  source,  word 
usage,  and  the  sex  of  seeker  by  sox  of  source  bv  anonymity  interact  ion. 

Sex  of  Source 

The  sox  of  source  was  significant  in  throe  univariate  analyses 
ol  variance:  time  to  solution,  numbei  ol  words  used,  and  nurabei  ol 
requests  for  information  on  partner's  understanding  or  status. 

As  will  hi1  seen  in  Table  8,  in  the  ini  orcorre  lat  ions  of  Depen¬ 
dent  Measures  sect  ion,  the  mean  within-ccll  Intercorrelation  ol  t  into  to 
solut  ion  w  i  t h  words  was  moderately  high  (r  ■  0.460,  p  •  0.01).  On  the 
other  hand,  the  number  of  requests  for  information  about  the  parluet 
was  only  marginal iv  correlated  with  the  numbei  ol  words  (t  0.141, 
p  0.06),  and  was  uncorrcluted  witli  time  to  solut  ion  (t  0.009). 

Tills  suggests  t  li.it  at  least  two  independent  underlying  I  actors 
accounted  tor  the  significant  multivariate  K  for  sex  ot  source. 

Time  to  solution.  The  average  time  to  solution  for  teams  with 
a  male  source  was  .’6.7  minutes,  while  it  was  only  .’1.5  minutes  tor 
teams  witli  a  iemale  source. 

Number  of  words.  Teams  witli  a  male  source  used  an  average  ot 
381  words  to  solve  t he i t  problem,  while  teams  witli  a  iemale  source  used 
an  average  of  only  791  words  t o  solve  t he  identical  problem. 


Summitry  of  t  ho  MANOYA  I’ot  t  omtod  on  t  he  Hot  ween- t  earns 


Sourci's  ot  Var  iat  ion 


Sources  ot  Var iat ion 

F-rat lo 

£  less  t 

Sox  of  Seeker  (SK) 

0.  771 

0.691 

Sex  ot  Source  (.SOl 

2.9  35 

0.016 

Anonymity  (A) 

1  . 1  80 

0.  164 

Word  Us.tRe  fWl 

lh.SU 

0 . 00 1 

SK  x  SO 

1.035 

0 . 4  66 

SK  x  A 

1.4  72 

0.216 

SK  x  W 

1 .  304 

0.240 

Si1  x  A 

1.684 

0 .  1 4  5 

SO  x  w 

0.  5  15 

0.887 

A  x  W 

0.  500 

0.41 0 

SK  x  SO  x  A 

2.275 

0.049 

SK  x  SO  x  W 

0.760 

0.701 

SK  x  A  x  W 

0.8h2 

0 . 6  1 0 

SO  x  A  X  W 

1.251 

0.  122 

SK  x  SO  x  A  x  W 

1 .284 

0.  101 

NOTH:  All  ot  t  ho  above  F- ratios  havo  IS  decrees  ot  freedom  In  tho 

numerator  and  IS  in  tho  denominator. 
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Partner's  understanding.  On  teams  with  a  male  source,  the  team 
members  questioned  each  other's  understanding  an  average  of  4.4  times 
per  team,  while  the  teams  with  a  female  source  averaged  only  2.6  ques¬ 
tions  of  this  type  per  session. 

Discussion ■  Although  I  had  no  firm  expectations  about  what  my 
study  would  find  with  regard  to  sex  differences,  1  certainly  did  not 
expect  the  results  to  turn  out  as  they  did.  The  sex  of  the  seeker 
made  no  apparent  difference,  and  the  sex  of  seeker  by  sex  of  source 
interaction  failed  even  to  approach  significance.  However,  the  main 
effect  of  the  sex  of  the  source  did  make  a  difference,  but  not  the  one 
that  might  have  been  expected:  On  a  problem-solving  task  requiring 
spatial  and  mechanical  skills,  it  was  the  females  who  were  superior, 
although  males  supposedly  are  better  at  problem  solving  and  have  higher 
spatial  and  mechanical  ability. 

On  the  other  hand,  this  was  a  telecommunications  experiment, 
and  females,  on  the  average,  do  tend  to  have  slightly  higher  verbal 
ability.  However,  this  is  unlikely  to  be  the  sole  reason  for  the  dif¬ 
ferences  between  the  male  and  female  sources  in  ...y  study.  Recall  that 
in  a  previous  experiment,  verbal  ability,  as  measured  by  a  standardized 
test,  could  not  be  shown  to  affect  performance  on  a  problem-solving 
task  similar  to  the  one  I  used  (Hoecker,  1979). 

I  can  only  offer,  based  on  my  observations,  an  educated  guess 
as  to  why  the  female  sources  were  superior  in  this  experiment. 

Although  I  did  not  maintain  records  of  this  during  the  course  of  the 
experiment,  1  did  notice  that  the  seekers  on  male-source  teams 
improperly  assembled  and  then  had  to  disassemble  the  model  more  often 
than  did  the  seekers  on  female-source  teams.  That  this  took  place  is 
sometimes  apparent  from  reading  the  transcripts  of  their  conversations. 
On  the  average,  the  female  sources  tended  to  give  instructions  that 
were  a  bit  longer,  but  apparently  more  thorough  and  less  ambiguous 
than  those  given  by  the  males. 

The  seekers  on  the  male-source  teams  had  a  greater  tendency  to 
become  confused  by  the  instructions  they  had  received.  Often  they 
would  try  to  assemble  the  model  according  to  their  instructions, 
finally  realize  that  it  was  not  working,  and  then  ask  for  help.  (The 
seekers  on  male-source  teams  averaged  2.6  requests  for  clarification 
per  session,  as  opposed  to  1.9  for  seekers  on  female-source  teams; 

0.1  >  £  ">  0.03.)  The  male  sources  on  these  teams  would  then  go  back 
over  the  different  stages  of  the  model's  assembly,  asking  each  time 
something  like,  "Did  you  understand  that  part?"  Each  of  these,  of 
course,  counted  as  a  request  for  information  about  the  partner's  under¬ 
standing  or  status.  This  accounts  for  the  significant  difference 
between  male-  and  female-source  teams  for  this  type  of  information 
request.  The  fact  that  the  male  sources  had  to  explain  some  things 
over  again  accounts  for  their  teams'  longer  times  to  solution  and 
greater  verbal  output. 


The  underlying  reasons  for  these  result  .  are  not  at  all  obvi¬ 
ous.  As  was  discussed  earlier,  males  tend  to  approach  problem-solving 
tasks  with  greater  confidence  than  do  females;  perhaps  in  this  case 
they  were  simply  over-confident  to  the  point  of  being  careless.  How¬ 
ever,  something  more  subtle  than  that  may  have  been  happening  here. 
Given  that,  on  the  average,  females  tend  to  be  poorer  than  males  at 
mechanical  and  spatial  tasks  (and,  perhaps  more  importantly,  also 
believe  themselves  to  hi'  poorer),  it  is  possible  that  the  females  in 
this  study  perceived  the  model  to  be  more  complicated  than  did  the 
males,  and  therefore  made  more  of  an  effort  to  describe  clearly  and  in 
greater  detail.  This,  perhaps  in  conjunction  with  the  previously  dis¬ 
cussed  differences  in  general  confidence  and  verbal  skills,  may  be  the 
reason  for  the  female  sources'  superiority  in  this  study.  However, 
this  assessment  represents  pure  speculation  on  my  part,  as  1  have  no 
hard  data  to  back  me  up.  It  will  require  further  experimentation  to 
fully  explain  the  sex  differences  found  in  this  study. 

With  regard  to  the  absence  of  a  significant  sex  of  seeker  by 
sex  of  source  interaction,  1  believe  that  this  can  be  attributed  to  the 
absence  of  nonverbal  communication  in  teletypewriter  dialogue. 

Word  Usage 


The  main  effect  of  word  usage  was  significant  beyond  the  0.05 
level  in  eight  of  the  univariate  analyses  of  variance.  These  eight 
consisted  of  the  analyses  on  all  t ive  of  the  time  and  verbal  measures, 
plus  analyses  dealing  with  three  of  the  five  types  of  requests  for 
information;  requests  for  new  information,  to  confirm  own  understand¬ 
ing  or  status,  and  to  confirm  partner's  understanding  or  status. 

Time  to  solution.  The  unlimited  teams  took  an  average  of 
27.6  minutes  to  solve  their  problem,  while  the  self-limited  teams  took 
an  average  of  20.5  minutes  to  solve  the  identical  problem. 

Number  of  words .  On  the  average,  the  unlimited  teams  used 
545.5  words,  as  compared  with  the  self-limited  teams,  who  used  128.0 
words.  Also,  averaged  across  all  teams,  seekers  used  an  average  of 
103.8  words,  while  sources  used  233.3,  thereby  accounting  for  the  sig¬ 
nificant  job  roles  main  effects. 

There  was  also  a  highly  significant  job  roles  by  word  usage 
interaction.  Unlimited  seekers  used  an  average  ol  177.8  words  while 
unlimited  sources  used  an  average  of  367.7  words.  On  the  other  hand, 
self-limited  seekers  used  an  average  of  29.8  words  and  sell -limited 
sources  98.8  words.  Notice  that  on  the  average  self-limited  seekers 
used  148  fewer  words  than  did  unlimited  seekers,  while  self-limited 
sources  used  268.9  fewer  words  than  did  their  unrestricted  counter¬ 
parts.  It  is  the  difference  between  the  last  two  figures  that  accounts 
ior  the  significant  job  roles  by  word  usage  interaction. 
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Tlie  above  pattern  of  significant  main  effects  for  word  usage 
and  job  roles,  plus  a  significant  job  roles  by  word  usage  interaction, 
is  repeated  by  many  of  the  dependent  measures  about  to  be  discussed. 

Messages .  Unlimited  teams  used  an  average  of  34.6  messages, 
while  self-limited  teams  used  an  average  of  16.0  messages.  For  this 
dependent  measure,  the  main  effect  of  job  roles  was  not  significant, 
but  there  was  a  significant  job  roles  by  word  usage  interaction.  Aver¬ 
aged  across  all  unlimited  teams,  the  seekers  generated  17.5  messages, 
and  the  sources  17.1.  Comparable  figures  for  the  self-limited  teams 
are  7.8  and  8.2  messages,  respectively.  Thus,  unlimited  seekers  gen¬ 
erated  an  average  of  0.4  messages  more  than  did  the  sources,  while 
self-limited  seekers  generated  0.4  messages  fewer  than  did  their 
sources.  Wl\at  this  means  is  that  on  unlimited  teams  the  seekers  gen¬ 
erally  typed  the  first  and  the  last  message  (e.g.,  "Hello"  and  "Good¬ 
bye"),  while  on  self-limited  teams  it  was  the  source  who  frequently 
had  the  first  and  last  messages.  In  the  latter  case,  the  seekers,  not 
wanting  to  waste  words,  usually  quietly  waited  for  the  source  to  begin 
the  instructions,  and  at  the  conclusion  of  the  problem  turned  the  sys¬ 
tem  off  without  a  parting  message. 

Message  length.  For  the  measure  of  message  length  there  was  a 
significant  main  effect  for  word  usage,  a  significant  main  effect  for 
job  roles,  and  a  significant  job  roles  by  word  usage  interaction. 
Unlimited  subjects  used  an  average  of  18.0  words  per  message  (11.8  for 
seekers,  24.1  for  sources),  while  the  self-limited  subjects  used  an 
average  of  only  8.9  words  per  message  (3.6  for  the  seekers,  14.2  for 
the  sources).  Thus,  there  is  essentially  the  same  pattern  for  message 
length  as  there  had  been  for  words:  On  the  average,  the  unlimited 
subjects  used  more  words  per  message  than  did  the  self-limited,  and 
sources  used  more  than  seekers,  with  the  difference  between  the  two 
job  roles  being  greater  in  the  unlimited  condition  than  in  the  self- 
1 imited . 


Notice  that  none  of  the  figures  above  can  be  obtained  by 
dividing  the  previously  given  values  for  words  by  the  reported  number 
of  messages.  As  an  illustrative  example  of  why  this  is  the  case,  let 
us  assume  that  we  have  only  two  subjects,  A  and  B.  bet  us  further 
assume  that  A  used  only  one  message  and  one  word,  and  that  B  used  two 
messages  and  four  words.  As  a  result,  A's  average  message  length  is 
one  word,  and  B's  is  two  words.  Therefore,  averaged  across  the  two 
subjects,  they  had  a  mean  of  1.5  words  per  message.  However,  if  you 
were  to  look  solely  at  their  mean  number  of  words  (2.5)  and  their  mean 
number  of  messages  (1.5),  you  would  have  arrived  at  a  different,  incor¬ 
rect  value  for  their  mean  number  of  words  per  message  (1.67  rather  than 
1.50). 


Communicat  ion  ratts.  Once  again,  as  was  the  case  with  words 
and  message  length,  there  was  for  communication  rate  a  significant  main 
effect  for  word  usage,  a  significant  main  effect  for  job  roles,  and  a 


significant  job  roles  by  word  usage  interaction.  Unlimited  teams 
transmitted  an  average  of  20.1  words  per  minute  oi  problem  solution 
time  (6.5  words  per  minute  for  the  seeker,  13.6  for  the  source),  while 
self-limited  teams  transmitted  an  average  ol  6.5  words  per  minute  (1.4 
for  the  seeker,  5.1  for  the  source). 

Requests  for  informat  ion.  The  significant  main  effects  and 
interactions  associated  with  the  different  types  of  information 
requests  can  best  be  explained  with  the  help  of  Table  7.  This  table 
shows  the  mean  number  of  each  type  of  request  generated  by  seekers  and 
sources  In  the  unlimited  and  self-limited  conditions.  The  correspond¬ 
ing  percentages  for  each  mean  were  computed  by  taking  that  mean  value, 
multiplying  by  100,  and  then  dividing  by  the  sum  of  all  of  the  means  in 
that  column.  Notice  that  these  percentages  do  not  represent  the  mean 
that  would  have  been  obtained  by  averaging  the  percentages  of  each 
individual  subject;  it  was  not  feasible  to  obtain  a  percentage  break¬ 
down  for  each  subject  because  some  subjects  did  not  generate  any 
information  requests. 

AIL  together,  the  96  subjects  in  this  study  generated  a  total 
of  823  requests  for  information.  Of  these,  only  seven  were  classified 
as  belonging  to  the  category  of  "other  types."  The  remaining  four 
categories  of  information  requests  therefore  account  for  over  99%  of 
all  requests. 

For  each  of  these  four  categories  the  main  effect  of  job  roles 
is  highly  significant  (j>  <  0.001  in  all  cases).  The  nature  ol  these 
differences  can  easily  be  seen  in  Table  7.  Further,  the  differences 
attributable  to  the  word  usage  factor  can  also  be  seen  in  Table  7.  For 
all  categories,  the  self-limited  teams  had  fewer  information  requests 
than  did  the  unlimited.  However,  the  decrease  in  the  number  of 
requests  for  clarification  failed  even  to  approach  significance 
(p  =  0.34),  although  the  decreases  for  the  other  three  were  signifi¬ 
cant.  For  these  latter  three  there  were  also  significant  job  roles  by 
word  usage  interactions,  the  natures  of  which  can  be  seen  in  Table  7. 

Discussion .  Some  of  the  findings  reported  in  this  section  are 
not  new.  Ford  et  al.  (1979)  used  a  word  usage  factor  very  similar  to 
the  one  I  used.  I,  like  they,  found  that  self-limited  subjects,  when 
compared  with  unlimited,  use  fewer  words  and  fewer  messages,  average 
fewer  words  per  message,  and  Ivive  a  slower  communicat  ion  rate.  This  is 
exactly  what  l  had  expected  to  find.  What  I  had  not  expected,  and  what 
I  consider  to  be  one  of  the  most  important  findings  ol  my  study,  is 
that  tl\e  unlimited  subjects  took  over  33%  more  time  than  the  self-lim¬ 
ited  to  solve  the  identical  problem.  (Ford  et  al.,  did  not  find  a  sig¬ 
nificant  difference  for  time  to  solution.  This  could  be  because  their 
experiment  used  fewer  subjects  and  had  less  statistical  power  than 
mine;  also,  only  half  rather  than  all  of  their  subjects  were  in  a  tele- 
t ypewr iter  mo de . ) 
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A  clue  as  to  why  my  self-limited  teams  were  faster  may  lie  in 
one  of  the  conclusions  from  the  study  by  Weeks  et  al.  (1974):  subjects 
communicating  in  a  teletypewriter  mode  tend  to  transmit  messages  as 
they  compose  them,  rather  than  compose  and  edit  the  messages  prior  to 
transmission.  However,  Weeks  et  al.  were  talking  only  about  unlimited 
subjects,  and  their  conclusions  do  not  necessarily  apply  to  subjects 
who  must  limit  their  word  usage.  This  is  because,  unlike  unlimited 
subjects,  self-limited  subjects  must  pay  for  every  word  they  use,  and 
therefr re  cannot  afford  the  luxury  of  "thinking  out  loud"  (i.e.  over 
the  teletypewriter).  Instead,  they  tended  to  compose  and  edit  their 
messages  in  advance.  The  result  is  that  their  messages  were  generally 
shorter,  while  apparently  conveying  as  much  information  as  the  longer, 
rambling  messages  sent  by  unlimited  subjects.  Therefore,  I  believe 
that  the  self-limited  teams  were  faster  than  the  unlimited  simply 
because  it  apparently  takes  longer  to  compose  a  message  on  a  teletype¬ 
writer  than  it  does  to  compose  on  notepaper  or  in  one's  head  a  concise 
message  and  then  transmit  that. 

The  above  statement  might  not  make  intuitive  sense  to  people 
who  find  it  faster  to  use  a  typewriter  to  compose  and  edit  manuscripts. 
Keep  in  mind,  though,  that  a  teletypewriter  is  not  nearly  as  convenient 
to  use  as  a  standard  typewriter,  in  that  the  user  is  not  permitted  to 
erase  anything  once  it  is  written,  and  cannot  even  return  to  a  previous 
line  to  strike  something  over.  This  makes  composing  messages  on  a 
teletypewriter  a  very  tedious  task. 

Let  us  now  move  on  to  examine  the  requests  for  information. 
First,  it  is  important  to  note  that  there  was  not  a  significant  differ¬ 
ence  in  the  number  of  requests  for  clarification,  suggesting  that  the 
subjects  found  unlimited  and  self-limited  dialogue  to  be  equally  clear. 
(In  this  context,  it  should  be  pointed  out  that  the  sex  of  the  source 
had  a  greater  effect  than  the  word  usage  factor  on  the  number  of 
requests  for  information.)  Second,  the  self-limited  subjects  generated 
far  fewer  of  the  four  remaining  types  of  requests  than  did  the  unlimited 
subjects.  However,  as  can  be  seen  from  the  percentages  shown  in  Table  7, 
some  types  decreased  more  dramatically  than  others. 

The  greatest  decreases  are  for  the  seekers'  requests  for  new 
information  and  the  sources'  requests  to  confirm  the  partner's  under¬ 
standing  or  status.  Apparently  requests  of  these  types  are  a  normal 
feature  of  unrestricted  dialogue  that  are,  for  the  most  part,  unneces¬ 
sary  for  the  exchange  of  factual  information. 

Sex  of  Seeker  by  Sex  of  Source  by  Anonymity 

The  multivariate  F  for  the  sex  of  seeker  by  sex  of  source  by 
anonymity  interaction  was  significant,  but  barely  so  (p.  <  0.049).  The 
only  significant  univeriate  F  test  was  on  the  questionnaire  ratings  of 
sources'  brevity.  In  describing  the  nature  of  this  three-way  interac¬ 
tion  it  is  more  convenient,  and  just  as  accurate,  to  think  of  it  as  a 
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simple  two-way  interaction:  the  anonymity  factor  by  the  factor  of  lik<- 
versus  mixed-sex  teams. 

In  the  like-sex  conditions  (i.e.,  all-maie  and  all-female)  the 
Introduced  sources  received  a  lower  average  rating  than  did  the  anony¬ 
mous  sources  (4.2  versus  5.2  on  the  seven-point  rating  scale).  On  the 
other  hand,  the  sources  on  the  introduced  mixed-sex  teams  received 
higher  ratings  than  their  anonymous  counterparts  (5.2  versus  4.7). 

1  am  unable  to  offer  what  I  would  regard  as  a  satisfactory 
explanation  for  this  interaction.  None  of  the  other  14  univariate 
analyses  reached  significance  for  this  source  of  variation,  so  it  is 
impossible  to  determine  a  firm  basis  that  would  account  for  the  sources 
being  rated  differently  in  the  different  conditions.  Because  the  multi¬ 
variate  F  barely  reached  significance,  and  because  only  one  of  the  uni¬ 
variate  F's  reached  significance  (again  just  barely),  I  am  inclined  to 
believe  that  this  was  simply  a  chance  effect. 

Job  Rotes 


There  are  two  significant  main  effects  for  job  roles  in  addi¬ 
tion  to  the  seven  discussed  in  the  Word  Usage  section.  These  are  for 
the  questionnaire  ratings  of  the  seekers'  clarity  and  brevity. 

Seekers'  clarity.  In  rating  the  seekers'  clarity  on  the  seven- 
point  rating  scale,  seekers  gave  themselves  an  average  rating  of  5.8, 
while  the  sources  gave  them  an  average  rating  of  6.5  (£  <  0.001). 
Unlimited  sources  gave  slightly  higher  ratings  to  seekers  than  did  the 
self-limited  sources  (6.7  versus  6.2),  causing  the  job  roles  by  word 
usage  interaction  to  also  reach  significance  (jj  <  0.05). 

Seekers'  brevity.  In  ratings  of  the  seekers'  brevity,  seekers 
gave  themselves  an  average  rating  of  5.9,  while  receiving  from  the 
sources  an  average  rating  of  6.4  (£  <  0.01). 

Discussion.  In  general,  seekers  did  not  rate  their  clarity  or 
brevity  as  highly  as  did  the  sources.  This  could  simply  be  due  to  mod¬ 
esty  on  the  part  of  the  seekers.  It  is  interesting  to  note  that  there 
were  no  significant  job  roles  main  effects  for  the  ratings  of  the 
sources'  performances,  perhaps  because  the  sources  used  many  more  words 
than  the  seekers,  thereby  providing  a  firmer  basis  for  judgment. 

In  addition,  on  the  average  the  sources  apparently  felt  that 
the  seekers  were  slightly  clearer  in  the  unlimited  than  in  the  self-1  un¬ 
ited  condition.  Recall,  however,  that  this  is  not  reflected  by  a  sig¬ 
nificant  difference  in  the  number  of  requests  for  clarification. 

Int error r  e  l  a  t io ns  of  Depend ent  Me  a isurcs 

The  within-cell  intercorrelation  matrix  for  the  15  dependent 
measures  is  shown  in  Table  8.  Although  the  values  in  Table  8  can  be 
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Within-cell  Intercorrelations  of  the  15  Dependent  Measures 


43 


V  ‘"i/WIty 


&  vt  ‘ 1  ffrVnif'T-  *  *  •' **  * 


Interpreted  as  though  they  are  standard  Pearson  product -moment  coir*  l.i- 
t  ion  coefficients  ( _r)  ,  they  were  computed  in  a  different  manner.  This 
is  because  there  were  16  different  factor-level  combinations,  or  cells, 
in  this  experiment  (see  Table  1),  and  each  of  these  combinations  of 
experimental  factors  had  its  own  influence  on  the  dependent  measures. 
These  influences  are  manifested  by  the  differences  among  tfn  cell 
means.  (These  differences  among  cell  means  are,  of  course,  what  is 
being  measured  by  analysis  of  variance.)  Clearly,  the  differences 
among  the  cell  means  can  bias  the  computed  intercorrelations  ol  the 
dependent  measures. 

The  computing  of  correlation  coefficients  has  as  one  of  the 
steps  the  conversion  of  all  raw  scores  to  standard  scores  (or  /.-scores) 
In  order  to  eliminate  the  influences  of  the  experimental  factors  on  the 
dependent  measures,  1  did  not  compute  the  r's  in  Table  8  us  in/,  stan¬ 
dard  scores  that  had  been  based  on  the  grand  mean  ot  each  distribution; 
rather,  I  based  the  standard  scores  on  their  corresponding  cell  means. 

This  procedure  compensated  for  the  differences  in  the  cell 
means,  hut  at  a  cost  of  statistical  power,  as  measured  by  degrees  ol 
freedom.  Typically,  r's  have  for  their  degrees  of  freedom  the  total 
number  of  pairs  of  observations,  minus  two  degrees  of  freedom,  .  n<  >1 
which  is  for  the  mean.  In  this  case,  lb  different  means  were  used 
(one  for  each  cell),  and  each  one  cost  a  degree  of  freedom.  The  total 
number  of  degrees  of  freedom  for  the  r's  in  Table  8  therefore  is  31, 
which  equals  the  number  of  teams  (48) ,  minus  the  number  of  cells  (16), 
minus  one. 

Given  that  the  r's  in  Table  8  have  31  degrees  of  freedom,  their 
absolute  values  must  therefore  exceed  0.344  to  be  significant  beyond 
the  0.05  level,  and  0.443  to  be  significant  beyond  the  0.01  level.  Out 
of  the  105  correlation  coefficients  shown  in  Table  8,  31  are  signifi¬ 
cant  beyond  the  0.05  level,  16  of  which  are  significant  beyond  the  0.01 
level.  This  very  large  proportion  of  significant  correlations  confirms 
the  importance  and  value  of  using  a  MANOVA.  However,  the  information 
contained  in  Table  8  does  more  than  simply  justify  the  use  of  MANOVA. 

It  provides,  for  the  first  time  in  a  Hopkins  telecommunications  experi¬ 
ment,  an  indication  of  which  of  the  dependent  measures  are  correlated 
with  each  other  and  which  are  independent.  There  were  some  surprises. 

As  seems  intuitively  reasonable,  time  to  solution  was  posi¬ 
tively  and  significantly  correlated  with  both  number  of  words  and  num¬ 
ber  of  messages.  However,  what  was  unexpected  is  that  the  number  of 
words  and  the  number  of  messages  are  almost  entirely  independent  of 
each  other.  Message  length  (simply  the  number  of  words  divided  b\  the 
number  of  messages)  is  independent  of  time  to  solution,  but,  as  would 
be  expected,  it  is  s ign if icantly  positively  correlated  with  number  of 
words,  and  strongly  negatively  correlated  with  number  of  messages. 
Communication  rate  (number  of  words  divided  by  time  to  solution)  is 
positively  correlated  with  both  number  of  words  .md  message  length. 
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So  what  does  all  this  tell  us?  It  tells  us  that  teams,  when 
they  take  longer,  tend  to  use  more  words  and  more  messages  than  teams 
that  did  not  take  as  long.  However,  just  because  one  team  may  have 
used  more  words  than  another,  it  does  not  necessarily  mean  that  it 
also  used  more  messages.  This  is  because  the  average  message  length 
tends  to  increase,  and  not  the  number  of  messages.  As  a  corollary  to 
this,  when  a  team  does  use  a  greater  number  of  messages,  the  messages 
tend  to  be  shorter.  Further,  the  teams  that  tended  to  use  more  words, 
and  therefore  longer  messages,  also  tended  to  have  a  faster  communica¬ 
tion  rate. 

Turning  to  the  requests  for  information,  the  teams  with  the 
higher  number  of  requests  for  clarification  also  tended  to  take  longer 
and  use  more  words  and  messages.  None  of  the  other  types  of  informa¬ 
tion  requests  were  significantly  correlated  with  time  to  solution  and 
number  of  words,  although  all  of  them  were  significantly  correlated 
with  the  number  of  messages.  This  is  hardly  surprising  in  light  of 
the  fact  that  each  of  these  requests  frequently  constituted  a  message. 

There  are  some  interesting  points  to  be  made  with  regard  to  the 
questionnaire  data.  Of  the  15  correlations  of  the  five  questions  with 
time,  words,  and  messages,  14  are  negative.  This  means  that  the  more 
time,  words,  and  messages  a  team  used,  the  lower  their  questionnaire 
ratings  tended  to  be.  Further,  the  more  requests  for  clarification  and 
requests  to  confirm  own  understanding  there  were,  the  lower  the  ratings 
tended  to  be,  especially  with  regard  to  the  requests  for  clarification. 
It  is  interesting  to  note,  however,  that  the  number  of  the  other  types 
of  requests,  as  well  as  message  length  and  communication  rate,  are 
uncorrelated  with  the  subjects'  perceptions  of  performance. 

Finally,  the  response  to  each  question  was  significantly  and 
positively  correlated  with  the  responses  given  to  the  adjacent  ques¬ 
tions  on  the  questionnaire.  This  suggests  that  the  answer  given  to  a 
question  may  have  been  affected  by  the  answer  given  to  the  previous 
one,  and  that  I  should  have  used  different  forms  of  the  questionnaire, 
with  the  questions  in  a  different  order  on  each  form. 

Mistakes  Made 


The  mistakes  I  made  with  this  experiment  were,  for  the  most 
part,  errors  of  omission.  There  were  some  potentially  useful  and 
easily  obtainable  dependent  measures  I  failed  to  examine  simply  because 
their  use  had  not  occurred  to  me  in  advance. 

As  has  been  previously  discussed,  useful  information  might  have 
been  obtained  by  keeping  track  of  the  number  of  times  the  seeker 
improperly  assembled  and  then  disassembled  the  model.  Also,  a  more 
accurate  measure  of  communication  rate  could  have  been  obtained  had  I 
kept  track  of  the  subjects'  actual  communication  time. 


Finally,  I  noticed  that  some  seekers  turned  off  the  system 
power  immediately  after  assembling  the  model,  while  others  took  as 
much  as  en  minutes  examining  the  model  and  asking  questions  before 
deciding  that  they  were  done.  I  have  no  records  to  indicate  whether 
the  seekers  took  longer  in  some  conditions  than  in  others  to  examine 
their  finished  models.  Further,  not  only  might  there  have  been  differ¬ 
ences  among  the  conditions  in  how  much  time  the  seekers  took  to  check 
out  their  completed  models,  but  there  might  also  be  differences  in  the 
lumber  of  words,  messages,  and  requests  for  information.  This  might 
have  proven  to  be  extremely  valuable  data.  For  example,  based  solely 
on  the  data  1  did  collect,  it  is  possible  that  the  unlimited  and  the 
self-limited  team  did  not  differ  significantly  while  assembling  the 
model;  however,  as  soon  as  the  model  was  assembled,  the  unlimited  teams 
took  the  time  to  make  sure  it  was  correct,  while  the  self-limited 
teams,  not  wanting  to  waste  words,  stopped  working.  Although  I  am 
reasonably  certain  that  this  is  not  what  happened,  I  cannot  prove  it. 

Thus,  l  feel  that  the  mistakes  1  made  in  conducting  this 
experiment  were,  for  the  most  part,  my  failures  to  recognize,  except 
in  retrospect,  the  need  for  the  above-mentioned  dependent  measures. 

Dif  f  icuit ies  Encountered 

I  encountered  only  one  major  difficulty  while  conducting  this 
experiment;  I  had  an  extremely  difficult  time  obtaining  subjects, 
especially  women.  There  were  2016  full-time  undergraduate  students  at 
Johns  Hopkins  the  semester  I  conducted  my  experiment,  1419  men  and  597 
women.  Finding  48  men  who  were  willing  to  participate  in  the  experi¬ 
ment  was  a  nuisance;  finding  48  women  proved  to  be  a  real  challenge. 

Although  1  had  no  trouble  finding  subjects  when  I  first  began 
the  experiment,  toward  the  end  I  was  forced,  among  other  things,  to 
interrupt  classes  being  taught  by  fellow  graduate  students  (with  their 
permission,  of  course)  to  ask  for  volunteers.  I  continually  put  up  my 
advertising  posters  wherever  on  the  campus  I  thought  they  might  do  some 
good,  even  to  include  the  shower  rooms  in  the  women's  dormitories.  In 
short,  I  had  an  extremely  difficult  time  making  sure  that  I  had  enough 
subjects,  especially  women.  I  am  convinced  that  it  would  have  been 
impossible  for  me  to  conduct  an  experiment  at  Hopkins  that  needed  many 
more  than  the  48  women  mine  used. 

I  also  encountered  a  couple  of  minor  problems.  Early  in  the 
experiment  I  found  that  the  subjects  had  a  tendency  to  forget  either 
the  time  or  the  room  number  I  had  assigned  to  them.  What  I  wound  up 
doing  was,  after  scheduling  a  subject  by  phone,  putting  a  note  in  his 
or  her  campus  mailbox  telling  the  subject  when  and  where  to  report  for 
the  experiment.  The  notes  also  included  my  name  and  campus  telephone 
number.  This  cut  down  on  absenteeism  tremendously.  Also,  those  who 
found  that  they  could  not  come  at  the  assigned  time  often  called  me 
well  enough  in  advance  for  me  t o  conveniently  reschedule  them. 
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The  second  minor  problem  involved  the  anonymous  teams.  In 
talking  to  the  subjects  during  their  pre-session  briefings,  I  had  to 
remember  to  refer  to  their  partners  as  "your  partner"  and  not  by  pro¬ 
nouns  such  as  he  or  she.  It  took  a  few  sessions  before  I  could  do 
this  smoothly. 

So,  to  sum  up,  I  feel  that  the  only  major  difficulty  involved 
with  conducting  this  experiment  was  due  to  the  small  size  of  the 
available  subject  pool,  causing  me  to  expend  a  tremendous  amount  of 
effort  obtaining  volunteers.  The  minor  difficulties  involved  getting 
the  subjects  to  remember  to  show  up,  and  getting  me  to  remember  not 
to  accidentally  give  extra  information  to  the  anonymous  subjects. 


Conclusion 

In  this  section  I  shall  first  quickly  summarize  the  important 
findings  of  my  study,  and  then  discuss  where  these  findings  lead  with 
regard  to  potential  areas  for  future  research. 

Summary 


One  of  the  purposes  of  this  experiment  was  to  determine  whether 
or  not  there  is  a  mismatch  in  the  way  men  and  women  communicate  to 
exchange  factual  information  and  solve  problems.  No  evidence  of  such  a 
mismatch  was  found,  suggesting  that  the  difference  in  performance 
between  like-  and  mixed-sex  teams  in  previous  studies  of  this  type  was 
due  to  the  confounding  influences  of  non-verbal  communication.  From 
an  applied  standpoint,  this  is  strong  evidence  that  natural  language 
computers  will  not  have  to  adjust  themselves  to  the  sex  of  their  user. 

Further,  whether  the  subjects  were  previously  introduced  or  not 
apparently  had  no  effect  on  their  being  able  to  work  together  as  a 
team . 


There  were  two  major  findings  in  this  study  in  which  condition- 
attributable  differences  were  demonstrated.  The  first  is  that  the  sex 
of  the  person  receiving  instructions  could  not  be  shown  to  have  any 
effect  on  the  team's  performance,  although  performance  was  affected  by 
the  sex  of  the  person  giving  instructions.  The  females  proved  to  be 
superior  on  all  measures  where  there  was  a  significant  difference.  1 
believe  that,  for  the  most  part,  this  can  be  attributed  simply  to  the 
females  being  more  thorough  and  careful  than  the  males. 

The  second  major  finding  is  that  when  the  subjects  in  this 
study  had  an  incentive  to  cut  down  on  their  word  usage,  they  not  only 
used  fewer  words,  fewer  messages,  and  averaged  shorter  message  lengths 
than  the  unlimited  subjects,  but  they  also  solved  their  problems 
faster.  I  believe  that  the  major  reason  for  this  finding  is  that  the 
self-limited  subjects,  in  general,  composed  and  carefully  edited  their 


message  .  |ii  loi  1 1'  transmission,  whllo  the  tin  1  ini  i  t  oil  sub  joct  ..  t  ended  to 
compost-  t  lit- i  i  mo  s  sagos  "on  lino." 

Knt  in  o  Rosea  i  eh 

Al  thi>  null  this  experiment  .tnswoi  oil  t  bo  appl  (oil  quest  foils  it  had 
boon  intended  t  o  .in  swot  .  it  b.i  .  t.ii-ed  many  now  thooiot  io.il  quest  ions 
th.it  i  .in  bo  answered  on  1  v  by  tnrthci  exp»-r  intent  at  ion  . 

With  i  ot-,. nil  to  the  I  hidings  on  sox  d t t f c renoes :  llow  robust  aro 
t  liov  ’  In  othoi  words,  .no  |  om.i  i  o  .  generally  superior  .it  giving  Inst  tui¬ 
tions.’  t't  aro  thoro  ooitaln  kinds  “I  problems  on  which  tna  1  os  would  bo 
superior  It  so,  what  kind,  ol  problems.’  Further,  are  there  certain 

kinds  ot  problems  that  would,  unlike  tin-  problem  I  used,  result  in  a 
oommun  i  oat  io1';:  nilsm.it  eh  between  the  sexes’  II  so,  ay,  a  111  what  kinds  ol 
pi  ob  loins.’  there  are  many  many  more  quest  ions  in  addit  ion  to  these. 

Iho  point  is  that  tin-  experiment  was  auvthlny,  hut  the  dot  init  Ivo  t  hoo 
tot  teal  work  on  sox  d  i  I  termers  in  Intoraetlvo  verbal  l  oloeommun  lea 
t  ion,  hut  at  loast  it  was  a  -it  at  t  . 

Vho  I  hidings  with  reyard  t o  sell  limited  versus  unlimited  woid 
usage  also  suggest  the  need  tot  lurthoi  research,  ospool.illv  bi-eausi- 
.'I  the  possibility  that  the  sol  l  - 1  trailed  teams  wete  tastet  than  the 
mil im i ted  because  thelt  method  ot  eommuuleat ton  was  less  natural. 

Ktemmei  (,ld/l),  In  comparing  l  lie  merits  ot  standard  versus  video-sup¬ 
plemented  telephones,  warned  against  assuming  that  t  hi-  mote  natural 
the  method  ot  t  e  I  eeomtmtu  lent  Ion,  the  better  the  method  is.  Unt  il  tn'W, 
to  the  host  ot  mv  knowledge,  no  laboratory  oxpoi  lim-ut  has  actual  !y 
demonstrated  a  slgnllle.int  linpro  vement  in  eltteleney  attributable  to  a 
decrease  in  the  eonvenionee  ot  eomiminle.it  ion  within  the  same  mode 
(e.g.,  t  e  let  vpi'wr  i  t  e  rl  .  However,  1  believe  that  my  sell  limited  teams 
wore  taster  than  the  unlimited  at  least  in  paid  because  ot  thelt 
fnabtl  Itv  to  eommuti  (eat  e  In  a  natural  manner.  Further,  the  mixed  sex 
teams  wete  as  ottielont  as  the  l  ike-  sox  almost  eerialnly  because  ot  the 
absence  ot  the  non  verbal  capacity  that  Is  available  In  the  more  natural 
eommuuleat  Ion  modes.  I  then-tore  tool  that  some  hone!  it  may  hi-  do  i  i  vod 
tom  I  at ut o  research  Into  how  "natural"  a  eommuuleat  ion  system  should  hi 
in  each  situation  In  ordt-i  to  exchange  Information  In  the  most  ottielont 
mantlet  .  Mv  t  hidings  stiyy.i-st  that  bv  following  this  course,  we  might  ,  a- 
l.iekl  id, -i  et  at  .  lllh,8l  predicted,  "he  able  to  oomimiuie.it,-  mote  el  lee 
tlvolv  through  a  m.iehine  than  laei-  to  lace." 
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